


А Guide to 
the World's Languages 


VOLUME 1: CLASSIFICATION 


А Guide to 


the World's Languages 


Volume 1: Classification 


Merritt Ruhlen 


With a Postscript on recent developments 


= 


Stanford University Press 
Stanford, California 


Stanford University Press 
Stanford, California 


@ 1987, 1991 by the Board of Trustees of the 
Leland Stanford Junior University 

Printed in the United States of America 
Original printing 1987 

Printing with Postscript 1991 

Last figure below indicates year of this printing: 


97 96 95 94 93 92 91 


Library of Congress Cataloging-in-Publication Data 


Ruhlen, Merritt, 1944— 

A guide to the world's languages / Merritt Ruhlen. 

. cm. 

Шад bibliographical references and indexes. 

Contents: v. 1. Classification, with a postscript. 

ISBN 0-8047-1894-6 (pbk. : v. 1 : alk. paper) 

1. Language and languages—Classification. I. Title. 
P203.R8 1991 
401'.2—dc20 90-21705 

CIP 


То Joseph Н. Greenberg 


Preface 


This volume is the first in а projected series of three, which when complete 
will constitute a general guide to the world's languages: 


Volume 1: Classification 
: Volume 2: Language Data 
Volume 3: Language Universals 


This first volume discusses the history and present status of the genetic 
classification of the world's languages and offers a complete classification. 
Volume 2 will provide certain kinds of linguistic and nonlinguistic informa- 
tion (e.g. location, number of speakers, genetic classification, consonants, 
vowels, pronouns, syntax) for a broad sample of the world's languages, and 
Volume 3 will survey the basic linguistic patterns that characterize human 
language around the world. 

This first volume provides both a historical perspective on the develop- 
ment of (linguistic) genetic classification and a current assessment of the 
state of our knowledge. (Throughout this book I shall use the terms "ge- 
netic classification" and "linguistic classification" in place of the seman- 
tically more accurate, but stylistically awkward, "linguistic genetic classifi- 
cation.") Typological classification, based on structural similarities between 
languages, is not considered in this volume, though it will be treated in de- 
tail in Volume 3. Similarly, the historical implications of the present classifi- 
cation will also be explored in Volume 3. 

Since human language impinges on so many related academic fields, 
and holds as well a general fascination for an even greater number of inter- 
ested laymen, these volumes do not presume any special training on the 
part of the reader. In particular, this first volume does not assume any back- 
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ground in linguistics, genetics, or taxonomy. All that is required is a curi- 
osity about how the world's languages are related, and about how this infor- 
mation has been discovered during the past two centuries. A few technical 
terms are introduced as needed, but the basic ideas are simple enough that 
the reader should encounter little difficulty. 

Again with the aim of making this volume as accessible as possible to the 
interested layman, I have provided section-by-section topical bibliographies 
instead of a single comprehensive bibliography at the back of the book. By 
this system, the reader interested in, say, Nilo-Saharan, can easily find all 
the listed books and articles on that one family. Care must be taken, how- 
ever, in tracing a particular citation in the text to the proper bibliography 
(and indeed to the proper section of that bibliography, since several bibli- 
ographies are broken into subsections). 

АП the linguistic maps in this volume are my own work, prepared from 
base maps published by the U.S. Geological Survey. 

A brief sketch of the origin of this volume is perhaps in order. I first be- 
came aware of the need for a general guide to the world's languages while I 
was a graduate student at Stanford University from 1967 to 1973, and espe- 
cially while I was working on Stanford's Language Universals Project from 
1971 to 1977. During this period I compiled a Guide to the Languages of the 
World (Stanford, Calif., 1975), a self-published work containing information 
on roughly 700 languages. That book eventually sold some 2,500 copies, 
was used as a textbook at a number of universities in this country and 
abroad, and came to the attention of Stanford University Press, which ex- 
pressed an interest in publishing a revised version. 

From 1977 to 1980 I devoted my full time to collecting information on as 
many languages as possible, and the 7,000 data sheets І accumulated dur- 
ing that period contain information on about 2,000 different languages. 
These 2,000 languages are identified by an asterisk in the complete classifi- 
cation given in section 8.6, and the information itself will be found in the 
forthcoming Volume 2. 

Since I had decided to arrange the information genetically (so that lan- 
guages and groups would appear nearest the languages and groups to 
which they were most closely related) rather than alphabetically, as in the 
1975 version (which made it easy to find individual languages, but difficult 
to perceive what traits characterized Romance, Germanic, Bantu, Polyne- 
sian, or any other language family), I had by late 1979 put together an over- 
all classification of the world's languages that was based largely on C. F. Voe- 
gelin and F. M. Voegelin, Classification and Index of the World's Languages 
(New York, 1977). It was my intention to arrange the information I had col- 
lected in terms of this classification. 

In November 1979 conversations with Donald Laycock and Darrell Tryon 
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at the Australian National University іп Canberra made me aware that both 
the Papuan and the Austronesian classifications given іп the Voegelins' 
book had already been largely superseded by more recent work, and they 
proposed that I revise my classification. They also suggested pertinent litera- 
ture to consult and other scholars I might contact for advice on specific 
groups. 

In 1980 I decided that if I was going to revise the Papuan and Austrone- 
sian sections of the classification, I should also attempt to consult experts in 
other language families and solicit their criticism of the classification I was 
then using. Correspondence with more than 100 scholars as well as a care- 
ful review of the taxonomic literature during the early 1980's led to contin- 
ual refinements and revisions of the classification. Furthermore, the addi- 
tion of a historical sketch and current appraisal of each family eventually 
metamorphosed what began as a single chapter into the present volume. 
The linguistic information that formed the original focus of this Guide will 
now appear in Volume 2, and Volume 3 will explore the implications of the 
data presented in the first two volumes. 

In addition to providing a complete classification of the world's lan- 
guages and a historical discussion of how, when, and by whom this knowl- 
edge was discovered, this volume has a third goal, namely, to examine the 
spectacular taxonomic career of the American linguist Joseph Greenberg, 
who has undoubtedly clarified the relationships of the world's languages 
more than any other scholar ever has—or ever will. Greenberg fills the 
same niche in linguistic classification that Linnaeus does in biological tax- 
onomy. While other linguists were content to pursue their careers working 
out minute details of small, closely related language families, apparently 
believing that this patchwork approach would one day knit itself into an 
overall classification of the world's languages, Greenberg had the audacity 
to classify, by himself, all the languages of Africa, the languages of New 
Guinea, and the languages of North and South America. His rejection of 
established dogma in African and Amerindian classification has been met 
with hostility and misguided attacks by a number of traditional scholars. 
Nevertheless, on questions of taxonomy he has seldom been found to be 
wrong. It has been my rare good fortune to have known Joseph Greenberg 
as a teacher, colleague, and friend over a period of almost 20 years, and it is 
with both admiration and affection that I dedicate this book to him. 

Many people have helped me in quite varied ways in writing this vol- 
ume, and [ wish to express my appreciation to them all. First and foremost 
I must thank my parents, who made everything possible, and my wife 
Anca, who supported my writing of this volume in ways too numerous to 
mention. 

Imust also acknowledge the extraordinary support that Stanford Univer- 
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sity Press has provided me during the lengthy gestation period of this vol- 
ume. William Carver, in particular, has guided and encouraged my work in 
a multitude of ways, and his contributions have truly been substantive, or- 
ganizational, and stylistic. His constant support and ability to solve any ap- 
parent problem are greatly appreciated. J. G. Bell also provided able counsel 
from time to time, and his advice was always taken most seriously. I would 
also like to thank my two fine copy editors, Kathleen Much and Barbara 
Mnookin, who eliminated numerous infelicities and errors from the origi- 
nal manuscript and rendered the final product both clearer and more 
readable. 

A third group of people whose contributions to this volume have been 
enormous, and are deeply appreciated, are the more than 100 scholars 
(most of whom I have never met) who generously offered me advice in try- 
ing to assemble the best classification possible on the basis of current 
knowledge. Their suggestions have had a profound impact on the final 
shape of the classification, and I wish to express my sincere gratitude to 
them all. Their names, together with their areas of expertise, are listed on 
pp. xix-xx, below. I must emphasize, however, that none of these spe- 
cialists necessarily agrees with the views expressed in this volume or with 
the final classification, for which I assume sole responsibility. Those schol- 
ars who have supplied me with their own unpublished classifications and 
have allowed me to use them in this book deserve special thanks: Gabriel 
Manessy (Gur), M. Lionel Bender (Nilo-Saharan), Robin Thelwall (Nubian 
and Daju), Alexander Militarev (Berber), Robert Jones (Karen), Gérard 
Diffloth (Austroasiatic), Lawrence Reid (Philippine languages), Paul Black 
(Australian), Joseph H. Greenberg (Amerind), and Eugene Loos (Panoan). 
I would also like to thank Sydney Lamb for reading the entire manuscript 
and making a number of useful suggestions, and L. L. Cavalli-Sforza, for 
his illuminating conversations on biological taxonomy, genetics, and re- 
lated areas. 

Finally І would like to thank a number of personal friends who have 
helped me out in various and sundry ways: Moises Moreno, Russell Ruhlen, 
Victor Sapojnikoff, Michael Saunders, Michael Sullivan, Andrew Wilson, 
and John Wilson. 

M.R. 
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А Note on Notation 


Although this volume does not employ a great deal of esoteric notation, 
some of its conventions and symbols may be unfamiliar to the general 
reader and so require a few words of explanation. 

1. A dagger (7) identifies those languages, or language families, that are 
now extinct. 

2. An asterisk (*) is used in two ways. Usually it identifies reconstructed 
(but historically unattested) words, e.g. Indo-European *akva-s “horse.” The 
second use of the asterisk is restricted to Chapter 8, where it identifies 
those languages whose sound system will in due course be found in Vol- 
ume 2, e.g. Таіти]. 

3. A right arrow (->) is used to indicate that one language (or family) 
name has been replaced by another. For example, Barea — Nera means that 
the language frequently referred to as Barea is called Nera in this volume. 

4. An equals sign (=) is used to point out alternate language (or family) 
names that for reasons discussed in section 8.1 are sometimes better known 
than the name adopted here, e.g. Nama (-- Hottentot). 

5. Brackets (|. . .]) enclose phonetic transcriptions, e.g. [dori] ‘to wish 
for, desire.' In Chapter 8 the number of languages in any group is indicated 
in brackets, following the name of the group; e.g. CELTIC [4] means there 
are four extant Celtic languages. 

6. Single quotes ('. . .") enclose English glosses of foreign words, e.g. 
French chien ‘dog.’ 

7. Double quotes (“. . .”) are used to indicate direct quotation from a 
source or to call attention to a word or phrase, e.g. there are no “primitive” 
languages. 
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8. Italics are used for non-English words (e.g. French chien ‘dog’), for 
individual letters in running text (treated as phonetic elements, whether in 
English or not), and for simple emphasis (e.g. typological traits like gender). 

9. Names of language groups are given in capital letters (e.g. SLAVIC, 
ROMANCE) in the lists and in the classification itself; names of single lan- 
guages are in capital-and-lower-case letters (e.g. Russian, English, Ohlone). 
In the case of single languages that are themselves independent branches of 
a family, either notation may be used, e.g. ALBANIAN or Albanian. 

10. The less-than sign (<) indicates that a word, sound, or meaning de- 
rives historically from another, e.g. French chien < Latin cane(m); French š 
< Latin k. 

11. -The greater-than sign (>) indicates that a word, sound, or meaning 
has changed into another over the course of time, e.g. Latin cane(m) > 
French chien; Latin k > French š. 

12. The similar-to sign (—) indicates either (a) that certain forms alter- 
nate morphologically in a language, e.g. English sing — sang — sung, Indo- 
European *ped- ~ *pod- ‘foot,’ or (b) that variant forms of a root are found in 
different languages of a language family, e.g. Nilo-Saharan tok — tek — dik 
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FAMILY X: 
I PRIMARY BRANCH 
П PRIMARY BRANCH 
Ш PRIMARY BRANCH: 
A GROUP 
B GROUP: 
1 SUBGROUP 
2 SUBGROUP 
3 SUBGROUP: 
a BRANCH 
b BRANCH: 
i SUBBRANCH 
ii SUBBRANCH: 
a GROUP 
В GROUP 


13. A hyphen (-) is sometimes used to separate a root from its affixes, 
either prefixes (e.g. English pre-fix) or suffixes (e.g. Greek dído-mi Т give’). 
It is also used with prefixes and suffixes by themselves (e.g. the English 
prefix pre-, the Greek suffix -mi Т) or following a root (e.g. Indo-European 
*ped- ~ *pod- ооё). 

14. The system of representing genetic tree structure is explained in 
section 8.2, but because such trees occur in lists throughout the book, a 
brief outline of the notation is given here. Genetic classifications are often 
represented in terms of branching diagrams, as in the figure on the fac- 
ing page. 

In this book such structures are represented by a system of indentation, 
where each successive level of branching is indicated by indentation to the 
right and each node is preceded by a letter or number to identify the taxo- 
nomic level of the group. Accordingly, the branching-tree diagram in the 
figure will have the representation shown in the list above. 


А Guide to 
the World's Languages 


VOLUME 1: CLASSIFICATION 


Introduction 


Classification, or taxonomy, is a fundamental pursuit of science, an indis- 
pensable first step in the search for understanding. Yet it is an area that 
tends to provoke fascination and deep interest in only a minority of schol- 
ars, and general boredom or lack of interest in the rest, be they biologists or 
linguists. In the development of comparative linguistics in nineteenth- 
century Europe, taxonomy was paramount, and substantial results were 
achieved, not only for Europe, but for the rest of the world as well. In the 
twentieth century a static "ahistorical" perspective has dominated lin- 
guistics, and taxonomy has been relegated to "historical linguistics" in the 
minds of many scholars. But despite its general neglect, discoveries since 
1950 have been even more spectacular, and more far-reaching in their con- 
sequences, than the truly important results of the nineteenth-century pio- 
neers. It is the aim of this volume to trace the history of language classifica- 
tion from its earliest beginnings to the present day, and to offer (in Chapter 
8) a complete classification of the world's roughly 5,000 languages. 

It is possible to classify languages in many ways, depending on the crite- 
ria chosen. One might, for example, divide the world's languages into two 
groups: those whose names begin with the letter P (e.g. Polish, Portuguese) 
and those that do not (e.g. English, Chinese). Such classifications are of 
course of no intellectual significance. Two kinds of classification have 
proved to be of great importance in the study of human language. Ty- 
pological classification categorizes languages in terms of structural proper- 
ties. For example, languages may be classified according to the number and 
kinds of vowels they use, or according to the order of the subject, verb, and 
object in a simple declarative sentence. Typological classification is impor- 
tant in many regards. Not only does it reveal that there are certain favored 
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patterns in linguistic structure (e.g. the і, е, а, о, и vowel system), but it 
also allows us to perceive how such favored patterns influence, and are 
influenced by, other patterns over time. А typological classification thus 
offers us a window on the evolution and nature of language that transcends 
individual languages. Typological classification will figure prominently іп 
the third volume of this Guide, but it is not the subject of this one. 

Genetic classification, according to which languages are classified on the 
basis of common origin, is the second important kind. The idea is quite 
simple and is illustrated in the following example. Two thousand years ago 
Latin was spoken across a broad expanse of Europe by citizens of the Ro- 
man empire. With the destruction of the empire, Latin speakers were iso- 
lated from one another by invading Germanic, Slavic, Arabic, and Hun- 
garian tribes. Gradually, over the centuries, the Latin spoken in what is 
today Rumania diverged from that spoken in present-day Italy, which in 
turn was increasingly different from the Latin spoken in present-day France 
or Spain. Each dialect continued to evolve along its own independent path, 
and the result is the contemporary Romance languages: Rumanian, Italian, 
French, Catalan, Spanish, and Portuguese, among others. Such genetic 
classifications, which trace the genealogy of a language family (Romance) 
from the parent language (Latin) to the daughter languages (Rumanian, 
Italian, etc.), are usually represented as “tree diagrams” (which are really 
inverted trees, with the branches pointing down), as here: 


T SRL 
Portuguese Spanish Catalan French Italian Sardinian Rumanian 


Such tree diagrams are an attempt to represent the historical develop- 
ment of a family of languages. The evidence offered to establish such a fam- 
ily is usually a set of etymologies in which each etymology contains forms 
from different languages that are both semantically and phonetically simi- 
lar, and whose similarities are explained by the hypothesis of common ori- 
gin. Continuing with our Romance example, one might cite Rumanian 
mind, Sardinian manu, Italian mano, French main, Catalan mà, Spanish 
mano, and Portuguese mão, all of which mean ‘hand.’ In this case the hy- 
pothesis of common origin is confirmed by Latin manus 'hand,' but we are 
seldom so fortunate as to have historically attested forms of the parent 
language. 

For the vast majority of the world's languages, nothing is directly known 
of the ancestor languages from which they derive, simply because these an- 
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cestor languages, unlike Latin, were never written down. In such cases we 
can often infer what the original form оҒа word must have looked like from 
the evidence of the surviving daughter languages, but such reconstructions 
are only approximate, because of the incompleteness апа ambiguities of the 
available evidence. Fortunately, the recognition of valid genetic groupings 
does not depend on the reconstruction of earlier forms. 

The membership of much larger language stocks—families of families— 
can also be deduced. It can be shown, for example, that the Romance fam- 
ily is but one branch of a larger and more ancient family that includes Ar- 
menian and Albanian, as well as Slavic, Germanic, Greek, Celtic, and Indic 
languages, among others. This larger group is known as the Indo-European 
(or Indo-Hittite) family. Although Indo-European reconstructions continue 
to undergo revision (and sometimes wholesale reevaluation) from time to 
time, the validity of the family itself is no longer questioned. 

How many such larger groups are there? Disagreements persist, but 
during this century it has proved possible to reduce the number of distinct 
families to which the world's 5,000-odd languages belong to between 10 and 
20. It has not yet been possible to demonstrate that all languages derive 
from a common source, though such a possibility cannot be ruled out. Nor 
is it possible to say where, when, or how human language developed. The 
problems posed by the origin of language are discussed, inconclusively, in 
section 7.4. 

The general plan of this volume is as follows. The first chapter offers а 
brief introduction to the principles of genetic classification. After first plac- 
ing the problem of genetic classification in its proper historical context, 
Chapter 2 traces the development of linguistic classification in Europe. 
Chapters 3—6 treat Africa, Asia, Oceania, and North and South America in 
a similar fashion. Chapter 7 assesses the prospects for future research, 
after first summarizing the methodological errors that must be avoided if 
progress is to be made in either large-scale groupings or low-level refine- 
ment. The problem of the origin and evolution of human language is also 
considered. Finally, Chapter 8 presents a complete classification of the 
world's languages, in three levels of detail: a brief index (section 8.4), a list 
of major genetic groups (8.5), and the complete classification (8.6). 

But again, before turning to the history of genetic classification in Chap- 
ter 2, itis important that we develop a few basic notions concerning genetic 
classification, so that the reader may better understand the various contro- 
versies to be discussed in later chapters. Thus the reader who approaches 
this volume with an interest in some particular part of the world is advised 
to read Chapter 1 before proceeding to his or her area of interest. Chapters 
2—6, by contrast, may to a large extent be read independently of one an- 
other. It is also advisable to read Chapter 1 before examining the classifica- 
tion in Chapter 8. 
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Genetic Classification: 
Principles and Methods 





The idea that groups of languages that share certain systematic resem- 
blances have inherited those similarities from a common origin is the basis 
for genetic classification. It is a simple, yet profound, idea, and seems today 
so obvious it is hard to imagine that it was not generally recognized until 
the late eighteenth century. As we shall see in Chapter 2, however, when 
the idea did finally penetrate the intellectual ethos, it triggered an explo- 
sion of research on the relationships of European languages that may be 
said to have inaugurated the modern scientific study of human language. In 
fact, in broadest terms, it would not be incorrect to view the entire develop- 
ment of linguistic science in nineteenth-century Europe as revolving around 
the gradual elucidation of this fundamental precept. 


1.1 DEFINITIONS AND METHODS 


Arguments over genetic classification usually concern either (1) the defi- 
nition of genetic classification or (2) the method of "arriving at" a genetic 
classification. Let us consider first the question of definition. 


Definition 1: A genetic classification is a subgrouping of all relevant lan- 
guages into genetic nodes. 


As illustrated in Fig. 1.1, a subgrouping is simply a branching diagram 
that subdivides the maximal domain A (i.e. all of the relevant languages) of 
the classification into smaller subgroups, or nodes: B, С, D, Е, ... 
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/ N/ AKA ЛА 


Fig. 1.1. Examples of subgroupings 


Definition 2: А genetic node is a group of languages each of which is more 
closely related to the others in that group than to any language outside 
the group. 


For the sake of lexica] variety I shall use the equivalent terms (genetic) 
node, unit, and group interchangeably. 

A genetic classification thus makes two statements. First, it affirms that 
certain languages are in fact related to each other (i.e. share a common an- 
cestor). Second, it specifies how the languages are interrelated in the form 
of a branching diagram. Consider the classification described by Fig. 1.2. 
This subgrouping indicates not only that a, b, c, d, and e are related lan- 
guages, butalso that a and b are most closely related to each other, as are c, d, 
and e. Іп fact, a and b constitute the genetic node B, and c, d, and e consti- 
tute the genetic node C. Finally, B and C together compose a "higher" node 
A, which is the maximal domain of this genetic classification. Another ge- 
netic classification might also claim that the same languages a, b, c, d, and e 
are related, but might propose a very different subgrouping, like that in 
Fig. 1.3. And of course many other subgroupings are both reasonable and 
possible. 


А 


> 
(ег) 
б 


a b c d e a b c d e 


Fig. 1.2. A typical subgrouping Fig. 1.3. A possible alternative subgrouping 
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Discovering the correct subgrouping is usually more difficult than simply 
determining when a group of languages is related, and for this reason more 
often leads to controversy. As we shall see below (see Table 1.1), even a 
modest amount of lexical evidence allows one to distinguish Indo-European 
(І-Е) from non-Indo-European languages, and this distinction is so sharp 
that there are in fact no extant languages whose membership in (or exclu- 
sion from) I-E is in doubt. There is thus strong consensus on which lan- 
guages make up the I-E family, and the same is true of the Romance, Slavic, 
and Germanic subfamilies. But the subgrouping of the languages within 
the I-E family (or within the Romance, Slavic, or Germanic subfamilies) is 
far from settled and remains even today a subject of scholarly debate. 


1.2 CONTINUOUS AND DISCRETE VARIATION 


One consequence of the fact that all languages are continually changing 
is that forms of speech that are mutually intelligible (i.e. dialects of one lan- 
guage) will gradually drift apart, provided only that they are isolated from 
each other for a sufficient period of time. Although languages are known to 
change at different rates during their histories (for reasons that are not com- 
pletely clear), under most circumstances 500 years is probably sufficient to 
seriously impair mutual intelligibility, and 1,000 years will usually obliter- 
ate it entirely. Drift of this sort is of course gradual, and except where geo- 
graphic or political barriers separate related neighboring tongues, there is 
likely to be a continuum of intermediate dialects in the borderlands. In fact, 
one of the principal problems in the subgrouping of languages involves this 
distinction between continuous and discrete variation. Within the Indo- 
European language family, the Romance, Slavic, and Germanic subfamilies 
are discrete groups. There are no languages that are ambiguous as to classifi- 
cation at this level; that is, there are no languages that appear to be half- 
Romance and half-Slavic. Rumanian, because it contains a sizable number 
of Slavic loanwords, is sometimes cited as such a hybrid, but in fact even 
a cursory examination reveals it to be overwhelmingly Romance in basic 
vocabulary, morphology, and syntax. The Slavic overlay is no more pro- 
nounced than the French overlay in English, which is also the result of 
large-scale borrowing. 

Within closely related groups of languages (e.g. Romance, Slavic, Indic, 
Polynesian, or Bantu), however, one often encounters continuous variation. 
For example, historically at least, both Italian and French comprised nu- 
merous regional dialects that changed gradually as one traveled from vil- 
lage to village, from a form of speech that was clearly Italian to one that was 
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unmistakably French. Although all villages could communicate with ease 
with neighboring villages, the end points (i.e. French and Italian) of this 
dialect chain were not mutually intelligible. Let us represent this situation 
as follows: 


(1) French a b c d e f g Italian 


Неге a-g represent transitional forms of speech between Italian and French. 
Furthermore, let us assume for the sake of this example that only con- 
tiguous dialects are mutually intelligible; thus people in village c can com- 
municate only with people in villages b and d. The problem that this ex- 
tremely common phenomenon poses for the taxonomist is the following: 
although we would like to put "French dialects" in one group and "Italian 
dialects" in another, there is no non-arbitrary way of doing so in every case. 
In formal terms, what this means is that no subset of the dialects shown in 
(1) constitutes a genetic node as defined in definition 2 above. 

To see why this is so, let us assume someone claims that French and dia- 
lects a-c constitute one genetic node and Italian and dialects d—g consti- 
tute another, as shown in Fig. 1.4. But dialect c is as close to dialect d as it is 
to dialect b, and it is in fact closer to dialect d (with which it is mutually 
intelligible) than it is to dialect a, even though dialect a is a member of the 
same genetic node. Obviously, our definition of genetic node is violated for 
this subgrouping, since French and dialects a, b, and c do not constitute "a 
group of languages each of which is more closely related to the others in 
that group than to any language outside the group." 

Two points should be kept in mind. First, the problem of classifying dia- 
lects is formally the same as classifying languages; and the problem of clas- 
sifying languages is the same as classifying groups of languages. Second, 
continuous and discrete variation are merely two facets of the same phe- 
nomenon. The transformation of dialects into distinct languages, and of 
continuous variation into discrete variation, are both the result of commu- 
nicative isolation and the passage of time. These two factors alone are suffi- 
cient to transform continuous dialects into discrete languages, and discrete 
languages into discrete groups of languages. Nevertheless, since continu- 
ous variation is more apparent at the lower levels of classification, the prob- 
lem of subgrouping becomes progressively more difficult as the languages 


French, a, b, c d, e, f, g, Italian 


Fig. 1.4. A proposed subgrouping of French and Italian dialects 


8 1 PRINCIPLES ОЕ CLASSIFICATION 


to Бе compared become more апа more alike, until, as іп the example just 
discussed, it becomes impossible to establish a non-arbitrary subgrouping. 
In the case of the Romance languages, more than 1,500 years of commu- 
nicative isolation have given the Eastern forms of Romance (i.e. the Ruma- 
nian dialects) a specific character that sets them apart from the rest of Ro- 
mance. Most of the Western Romance idioms, however, show no such 
sharp cleavage into distinct groups, a conseguence of there having been 
continuous contact, at least between neighboring languages, during the 
time of total isolation from Eastern Romance. The impression given by 
Western Romance, then, is that of one language gradually shading into the 
next, with few clear-cut boundaries. Subgrouping within Western Ro- 
mance thus becomes arbitrary, and the lines of division often assume a geo- 
graphic or political basis rather than a strictly linguistic one. The traditional 
subgroups of Western Romance (i.e. Italo-Romance, Rhaeto-Romance, 
Gallo-Romance, and Ibero-Romance) are to be understood in this sense. 
Identical problems are often encountered in grouping dialects into lan- 
guages. For example, let us assume there are five dialects related as in Fig. 
1.5, where a line connecting two dialects indicates mutual intelligibility. 
Since all of these dialects are mutually intelligible with dialect A, this dia- 
lect is taken to be the "norm" of the language, and dialects B—E are said to 
be dialects of this language even though they themselves are not mutually 
intelligible. The choice of A as the norm here is likely (but not certain) to 
accord with the realities of the linguistic origins of these dialects, but in an- 
other case (see Fig. 1.6) no such convenient choice is available. Here there is 


B C 

C D E 
>< 

Е G H 


Fig. 1.5. Dialects that may be represented Fig. 1.6. Dialects that тау be represented 
as a single language ав multiple languages 


D 
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no single dialect mutually intelligible with all other dialects. In cases like 
this linguists often can do no better than to divide the dialects into groups 
such that within each group there is at least one dialect intelligible with all 
other dialects in that group. In Fig. 1.6 language À might be defined as dia- 
lects A, B, C, and E and language F as D, F, G, and H. The arbitrary nature 
of such a subgrouping— several others would be as reasonable—should be 
evident. Moreover, in the real world the choice of a dialect to represent the 
norm of a language (a serious problem in countries trying to develop a 
standard national language) is often influenced by nonlinguistic factors. 
For instance, a dialect with numerous speakers or one enjoying high pres- 
tige— whatever its actual linguistic affinities for the other dialects—is much 
more likely to be chosen as the norm for a language than one with few 
speakers or low prestige. 


1.3 GENETIC AFFINITY 


Having defined the notion of genetic classification, and having discussed 
a few characteristic problems in identifying genetic nodes, let us now ex- 
amine how one actually goes about discovering a genetic classification. The 
process involves two stages. During stage one the investigator seeks to es- 
tablish "evidence" that certain languages are in fact related; during stage 
two "other evidence" is adduced for a particular subgrouping of the lan- 
guages found to be related in stage one. The nature of the evidence re- 
quired (or, as some might say, permitted) for proof of genetic affinity (stage 
one) or for proof of a subgrouping (stage two) is often at the center of classi- 
ficatory controversies. 

In attempting to ascertain genetic relationships, where does one start? 
According to Greenberg (1957: 42—43), the oldest and still "the swiftest and 
surest method" is the comparison of basic vocabulary for a large number of 
languages from some broad geographic area. Inevitably the languages com- 
pared will be found to group themselves into subgroups on the basis of sys- 
tematic resemblances. Consider, for example, Table 1.1 (adapted from 
Greenberg 1957: 43), which shows the phonetic form of just nine basic 
words in 14 European languages. Yet even this meager evidence is sufficient 
to demonstrate that these 14 languages are to be divided into five groups 
(A, B, C, D, E), and furthermore that A, B, and C exhibit resemblances 
among themselves in which neither D nor E participate. The languages in 
group A are Germanic languages; those in group B, Romance languages; 
and those in C, Slavic languages. All three are branches of the Indo-Euro- 
pean family. Group D includes two Finnic languages, and group E is the 
well-known language isolate Basque. 


Table 1.1. Comparative Vocabulary of European Languages 





Word 
Language one two three head eye ear nose mouth tooth 
A Swedish en tvo tre hyvud ega øra næsa mun tand 
Dutch еп tvé dri hoft Ox or nos mont tant 
English won tuw erij hed aj ijr nowz тауу0 tuwo 
German ajns tsvaj draj kopf auga бг naze munt ізап 
В French ææ dø tRwa tet сеј oRsj ne buš da 
Italian uno due tre testa okjo orekjo naso boka dente 
Spanish uno dos tres kabesa Oxo огеха naso boka djente 
Rumanian un doj trej kap okj ureke nas gurə dinte 
C Polish jeden dva tši glova oko ихо nos usta тор 
Russian adin dva tri galava oko UKO nos rot zup 
Bulgarian edin dva tri glava oko uxo nos usta zəb 
D Finnish yksi kaksi kolme pe silmæ korva nenæ sū hamas 
Estonian yks kaks kolm pea silm korv nina su hamas 


E Basque bat bi hiryr byry begi belari sydyr aho orts 
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Within each of these groups the resemblances аге so overwhelming that 
they reguire little comment even for the linguistically naive reader. More- 
over, to а linguist familiar both with the precise phonetic meaning of each 
symbol and with the common evolutionary paths that transform one sound 
into another, the resemblances in Table 1.1 are even stronger and more ap- 
parent. The reader may observe how clearly and unambiguously the sup- 
posed “hybrid” Rumanian groups itself with the Romance languages. Note 
also the homogeneity of both the Slavic and Finnic groups. 

The evidence for joining А, В, and C in an Indo-European family is ad- 
mittedly less transparent, yet even here the reader should be able to per- 
ceive significant similarities among all three groups for the words ‘one,’ 
‘two,’ ‘three,’ ‘eye,’ and ‘nose’; and between А and B in their words for ‘ear’ 
and “tooth.” 

Genetic relationship is not, however, the sole source of sound/meaning 
resemblances such as those illustrated in Table 1.1. Three other possible 
causes must be entertained, and eliminated, before one may assert that the 
similarities in question derive from a common origin: chance, sound sym- 
bolism, and borrowing. 

Since the number of distinct sounds used in human languages is rela- 
tively small, and furthermore, since certain sounds are common to (prac- 
tically) all languages (e.g. t, k, m, n, s, i, а, u), inevitably chance resem- 
blances between languages will arise from time to time. Usually such 
historical accidents are restricted to one-syllable words; Hockett (1958: 486) 
points out the similarity of German nass ^wet' and Zuni (an American In- 
dian language of New Mexico) nas ^wet.' Occasionally one may encounter a 
two-syllable word with a chance resemblance, such as Rumanian dori 'to 
wish for, desire' and Lau (an Austronesian language spoken in the Solo- 
mon Islands) dori 'to wish for, desire.' How, one may justifiably ask, do we 
know that this similarity between Rumanian and Lau is nothing more than 
a chance resemblance? The answer is that all other causes may be ruled out 
in this case, leaving chance as the only possibility. First, borrowing is im- 
plausible because there has been no known historical contact between these 
peoples, and the immense geographic distance separating them more or 
less precludes such contact. Second, sound symbolism is eliminated as a 
possible cause because there is no intrinsic relationship between the sounds 
[dori] and the meaning 'desire' that is attested in the world's languages 
(more on this below). Third, genetic relationship may be ruled out by bring- 
ing into consideration other languages. As soon as this is done it becomes 
obvious, from abundant similarities, that Rumanian is a Romance language 
within the Indo-European family, whereas Lau is an Oceanic language 
within the Austronesian family. The cumulative weight of all the relevant 
evidence reveals the isolated, and therefore insignificant, nature of this par- 
ticular resemblance. 
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Тһе whole process of classifying languages into genetic groups оп the 
basis of sound/meaning similarities is predicated on the fundamental а5- 
sumption that the relationship between sound and meaning is arbitrary іп all hu- 
man languages. In general, this assumption holds true, as English ‘dog,’ 
French chien, Spanish perro, and Russian sobaka illustrate. Nevertheless, it 
has long been recognized that for a small group of words the assumption is 
invalid; in these few instances there is, in fact, an intrinsic connection be- 
tween sound and meaning. The most obvious examples are onomatopoeic 
words, such as 'tinkle' and 'buzz' in English. But there are also a few non- 
onomatopoeic words that are known to display sound/meaning similarities 
in languages around the world. The two most famous examples are mama 
for “mother” and papa (or tata — dada) for 'father.' It is also possible that 
there is sound symbolism between the front vowel [i] and the meaning 
‘here, this, and between a non-front vowel [a] or [u] and the meaning 
'there, that.' Such a pattern appears to have a global distribution. 

Since sound symbolism affects such a small percentage of the words in 
any language's lexicon, the elimination of these few items from considera- 
tion in establishing genetic classifications represents more of a theoretical 
nuisance than a practical impediment. In point of fact, neither of the two 
nonhistorical causes of sound/meaning resemblances (i.e. chance and sound 
symbolism) greatly affects genetic classification. If a linguist cannot array 
more evidence (in the form of cross-language lexical similarities) than a few 
(possibly accidental) resemblances and several items known to be affected 
by sound symbolism, he is most unlikely to hazard any kind of guess at all 
about genetic classification. 

The final nongenetic source of lexical resemblances is borrowing. It is 
probably true that whenever two languages come in contact for an ex- 
tended period there will be borrowing of lexical items from one to the 
other, sometimes in both directions. Since relatively few languages have 
existed in such geographic isolation that they could avoid contact with 
other languages, it is understandable that the failure to recognize borrowed 
words has at times led to confusion in attempts at genetic classification. It is 
certainly true that the impact of lexical borrowing on genetic classification 
has attracted a good deal of scholarly attention. Probably too much, since, 
according to Greenberg (1957: 39), “while in particular and infrequent in- 
stances the question of borrowing may be doubtful, it is always possible to 
tell whether a mass of resemblances between two languages is the result of 
borrowing." 

There are two principal techniques for detecting borrowed words. The 
first involves the recognized propensity for lexical borrowing to affect cer- 
tain semantic domains (such as previously unknown cultural items, e.g. 
kangaroo, tobacco, television) and certain grammatical categories (nouns 
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rather than verbs, rarely inflectional or derivational affixes). Although al- 
most any word may be borrowed on occasion, there is no question that 
under most circumstances basic vocabulary (e.g. eye, nose, head, leg, and 
pronouns) is fairly resistant to borrowing. Because of the small, and spe- 
cific, portion of the lexicon that borrowing tends to affect, it is normally not 
difficult to identify both the borrowed words and the donor language or 
languages. 

А second technique for distinguishing borrowed words from genetic 
cognates is the extension of the range of comparison to other languages. 
For instance, let us assume that someone claims language X is genetically 
related to language A on the basis of certain lexical similarities. Let us as- 
sume further that language A is known to be a member of a language family 
that includes languages B, C, D, E, and F. Since X is claimed to be related to 
A, it must also be related to languages B—F. Therefore, one may anticipate 
finding similarities between X and B, X and C, etc., just as one has appar- 
ently discovered cognates between X and A. If, on the other hand, the lex- 
ical similarities turn out to be restricted to X and A, we may be fairly con- 
fident that we are dealing with borrowing. English displays numerous 
lexical similarities with French (e.g. face, poison, poor, ease, change, use, 
village), but very few with the other Romance languages. As a conse- 
quence, even in the absence of historical evidence, it would be clear that 
such words must have been borrowed from French. 

The case of the Tai languages, of which Laotian and Standard Thai are 
the best known, offers an instructive example of how borrowed words may 
temporarily confound genetic classification. It was formerly thought, on 
the basis of phonological similarities (e.g. the presence of tone and mono- 
syllabism) and a limited number of apparent cognates, that the Tai lan- 
guages exhibited a distant genetic relationship with the Chinese languages, 
and hence with the Sino-Tibetan family as a whole. As we shall see, how- 
ever, phonological similarities alone (such as the presence or absence of 
tone) are not reliable indicators of genetic affiliation. Furthermore, a closer 
examination of the supposed cognates revealed that the perceived simi- 
larities were almost uniquely with Chinese, and not with the other Sino- 
Tibetan languages. Today the Tai languages are generally considered to be 
most closely related to the widespread Austronesian family. The apparent 
cognates with Sino-Tibetan have turned out, on closer inspection, to be 
merely early borrowings between Chinese and Tai. 

In establishing genetic affinity between languages, linguists have em- 
phasized lexical similarities. This does not mean, however, that other evi- 
dence is without interest; in principle at least, everything should be taken 
into account, and in fact such factors as similarities in morphological ir- 
regularities may be of extremely high probative value. Nevertheless, lexical 
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information is of paramount importance, especially іп the initial stages of 
classification, because (1) it provides а large number of independent, and 
potentially differing, items; (2) it offers a point of comparison among all 
languages in the sense that certain fundamental vocabulary items may be 
expected to occur іп all languages; and (3) often, especially in areas of the 
world lacking detailed linguistic documentation, lexical information is all 
that is available for comparison. 


1.4 SUBGROUPING 


Let us turn now to the second stage of language classification, the estab- 
lishment of a subgrouping for the languages found to be related during 
stage one. It should be emphasized from the outset that the accuracy of the 
subgrouping obtained during stage two depends on having recognized, 
during stage one, all the relevant languages in a particular genetic unit. 
Where the true breadth of a language family has not been correctly deline- 
ated in stage one, the subgrouping implied by this (partial) information is 
likely to be incomplete, erroneous, or both. Success at stage two is thus cru- 
cially dependent on prior success at stage one. In subsequent chapters we 
shall encounter several examples where a failure to delimit correctly the ex- 
tent of a genetic group of languages has undermined the proposed sub- 
grouping. Identifying all the relevant languages is particularly troublesome 
for groups of languages with broad geographic distribution, such as Aus- 
tronesian or Amerind. 

If lexical cognates are the primary criterion for proof of genetic affinity, 
then exclusively shared innovations occupy a similar position with regard to 
subgrouping. Why are innovations so important? Let us not forget that in 
investigating genetic classification, we are attempting to understand the 
history and prehistory of linguistic groups. During stage one we adduce 
evidence that certain languages derive from a common ancestor. During 
stage two we form a hypothesis about how this common ancestor evolved 
into its currently extant (or historically attested) offspring. Genetic classifi- 
cation, if it is successful, is thus a reflection of history. Exclusively shared 
innovations are important in determining subgrouping because they attest 
to a period of common independent development. As a language evolves 
independently of related languages, it goes its own way, so to speak, inno- 
vating in its own individual way. Each such innovation, whether lexical, 
phonological, semantic, or syntactic, is an indication of the independent 
development of the language or group in question. 

Not all putatively shared innovations carry the same weight, however. 
Some changes (e.g. the voicing of intervocalic p, t, k) occur so frequently in 
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the world's languages that it is often difficult to be certain that such appar- 
ent innovations are not in reality separate but convergent developments. 
Accordingly, such innovations (representing known evolutionary paths of 
development) must be given less weight than truly idiosyncratic develop- 
ments, such as most lexical innovations. It is highly unlikely, for example, 
that two languages would independently come to associate the sounds tik 
with the meaning 'finger.' Since lexical innovations do not represent evolu- 
tionary paths (and thus the possibility of convergence is all but eliminated), 
they carry more weight. 

In theory, at least, the division of a group of languages into two parts 
may be established by the presence of a single exclusively shared innova- 
tion in one part but not in the other. In the real world, however, a single 
innovation is seldom convincing by itself, and in most instances a scholar 
will propose a set of shared innovations to justify a particular subgrouping. 
The greater the number of shared innovations, the more confident we may 
be that our division into two groups represents a true historical split. 

Obviously the longer that two languages (or groups of languages) have 
been separated, the more innovations will be found in each. Thus the diffi- 
culty of discovering the historically correct subgrouping is to a large extent 
a function of the length of time between various proposed splits. The situa- 
tion may be understood most easily by looking at a few simple diagrams. 
Consider the genetic subgrouping in Fig. 1.7. We define x as the length of 
time between Time A and Time B, and y as the length of time between Time 
B and Time C. The determining factor in whether one can discover (either 
easily or with difficulty) a subgrouping such as that in Fig. 1.7 is the ratio 
х/у. As this ratio increases, the subgrouping problem becomes progres- 


Time A A 
Xx 
1 В В, В, B, 


Time C C, C, С; С, C; CÇ, 


Fig. 1.7. A possible subgrouping of family А 
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sively simpler; as the ratio decreases, the subgrouping problem becomes 
more and more difficult, and eventually impossible. А case where х is large 
апа y is small is illustrated in Fig. 1.8. The ratio х/у = 3,500/1,500 = 2 %is 
relatively large, and as we saw in Table 1.1, the correct subgrouping is evi- 
dent even on inspection of only a few fundamental vocabulary items. 

Consider next a case at the opposite end of the scale, where x is small 
with respect to y, as in Fig. 1.9. In this (somewhat simplified) example, 
spoken Latin divided into three distinct groups following the collapse of 
the Roman Empire around А.р. 500. In this instance the x/y ratio is com- 
paratively small (500/1,000 — 12), and as Malkiel's (1978) historical survey 
clearly indicates, there has been no small number of classificatory disputes, 
and still no general consensus, on the internal relationships of the Ro- 
mance languages. 

Table 1.2 gives the equivalents of 14 words (the same nine words in Table 
1.1 plus five pronouns) in 19 Romance languages or dialects. As we can see, 
whereas there is abundant evidence for the genetic unity of the group as a 
whole (stage one evidence), there is a relative paucity of evidence for sub- 


| = 3,000 в.с. Indo-European 

x = 3,500 years 

| = A.D. 500 Romance Germanic Slavic 

v = 1,500 years 

С = a.p.1986 Rumanian Spanish English German Russian Polish 


Fig. 1.8. Differentiation of the Indo-European family (partial) 
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А = A.D. 500 Romance 
x = 500 years 
В = А.р. 1000 West Central East 


y = 1,000 years 


С = А.р. 1986 Italan Spanish Sardinian Daco-Rumanian Arumanian 


Fig. 1.9. Differentiation of the Romance family (partial) 


grouping these languages (stage two evidence). There is in fact no obvious 
subgrouping suggested by the lexical evidence in Table 1.2. 

A careful examination of Table 1.2 does, however, reveal a number of 
hints that the Rumanian dialects form a group of languages distinct from 
the rest of Romance: 

1. The use of [gura] ‘mouth’ (< Latin gula ‘throat’) in all the Rumanian 
dialects, as opposed to [buka] ‘mouth’ (or related cognates) in the other Ro- 
mance languages is indicative, as is the discovery that Rumanian [gura] has 
cognates in other Romance languages, cognates that have retained both the 
Latin meaning and the word-medial [I]. Furthermore, the Latin word bucca 
‘mouth’ has not disappeared from the Rumanian dialects, but rather has 
undergone a semantic shift: ‘mouth’ > ‘cheek’ > ‘buttock,’ and this un- 
usual semantic innovation is found in both Daco-Rumanian and Arumanian. 

2. In the pronoun ‘she,’ the Rumanian dialects share the phonological 
innovation | > j. The Romance languages generally preserve a lateral (either 
Ш or [A]) in this word. Only Sardinian, which has transformed the lateral 
into a retroflex stop [d], and New World Spanish, in which [A] has been 
reduced to [j] (the sound of ‘y’ in ‘yet’), violate this distribution. 

3. The Rumanian dialects have all raised the first vowel in dinte ‘tooth,’ 


Language 


Arumanian 


Megleno-Rumanian 


Daco-Rumanian 
Istro-Rumanian 
Dalmatian 
Sardinian (Log) 
Sardinian (Camp) 
Italian 

Friulian 
Romansch 
French 
Franco-Provencal 
Provençal 
Gascon 

Catalan 

Spanish (Spain) 
Spanish (Mexico) 
Galician 
Portuguese 


one 


un 
un 
un 
ur 
join 
unu 
unu 
uno 
un 


бе/ге 
уп 
un 
uno 


UNO 
un 


doj 
doj 
doj 
doj 
doi 
duos 
duus 
due 
doi 
duos 
do 
du/due 
dus 
dys 
dos 
dos 
dos 
dows 
dəjš 


Table 1.2, Comparative Vocabulary of the Romance Languages 


three 


trej 
trej 
trej 
trej 
tra 
tres 
tresi 
tre 
tre 
trais 
tRwa 
tra 
tres 
tres 
tres 
tres 
tres 
tres 
treš 


Word 

head 
kap 
kap 
kap 
kap 
kup 
kabu/testa 
kabu/testa 
kapo/testa 
čaf 
čo 
Sef/tet 
teta 
kap/testa 
kap 
kap 
kabeba 
kabesa 
kabe@a/testa 
kabesa 


eye 
okAju 
wokAu 
okj 

okAu 
vakl(o) 
oju/ogru 
ogu 

okjo 

voli 

сеќ 
сеј/је 

2у 

el 
oeÁ/oej 
UA 

OXO 

OXO 

040 

oÁu 


ear 


ureakAe 
ureakAi 
ureke 
urekÁe 
orakla 
orija 
origa 
orekjo 
oreAe 
uraÁa 
oRsj 
oreÁe 
awreÁo 
awreÁe 
urgÁo 
orexa 
orexa 
oreÁa 
ureAa 


nose 


"^a 
каль 


паз 
nas 
nas 
nuos 
nare 
nazu 
naso 
nas 
nas 
ne 
nas 
nas 
nas 
nas 
nari 
naris 
nari 
naris 


Language 


Arumanian 
Megleno-Rumanian 
Daco-Rumanian 
Istro-Rumanian 
Dalmatian 
Sardinian (Log) 
Sardinian (Camp) 
Italian 

Friulian 
Romansch 
French 
Franco-Provencal 
Provencal 
Gascon 

Catalan 

Spanish (Spain) 
Spanish (Mexico) 
Galician 
Portuguese 


mouth 


gurə 
gurə 
gurə 
gurə 


buka 
buka 
boka 
buce 
buoca 
bus/goel 
bo0ə 
boko 
buko 
boka 
boka 
boka 
boka 
boka 


ju 
dego/ego/eo 
deu 
io 
jo 
eu 
Zo 
бо 
jew 
zu 
Zo 
jO 
jo 
eu 
ew 


nagan 


er 


ЕЕЕ 


jelu 
jel” 
jel 
jel 
jal 
idu 
idu 
eAi 
el 
el 


il 
К1)/е(1) 
el 


et 
ел 
е! 
el 
el 
eli 


ela 
elo 
eÁo 
eÁa 
eja 
ela 
ela 


noj 

noj 

noj 

noj 

noi 

nos 

nos 

noi 

nus 

nus 

nu 

no 
nusawtres 
nuzaw 
nuzaltras 
nosotros 
nosotros 
nos 

nos 
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not ап unusual type of change, but опе that Table 1.2 indicates only for 
Friulian among the other Romance languages. (The phonological innova- 
tion of raising vowels before nasal consonants probably developed inde- 
pendently in Friulian and the Rumanian dialects.) 

4. The retention of the -КІ- cluster in the words for ‘eye’ and ‘ear’ іп most 
Rumanian dialects (Standard Rumanian = Daco-Rumanian is an exception 
here) is a trait shared only by Dalmatian in Table 1.2. Whether the retention 
(as opposed to innovation) of a trait has any evidentiary value for sub- 
grouping is a matter of scholarly disagreement. 

“Тһе few words examined in Table 1.2 can only hint at the distinctness of 
Eastern Romance (= the Rumanian dialects). An examination of all the evi- 
dence, however, reveals an overwhelming number of innovations separat- 
ing Eastern Romance from the rest of the Romance family. 

Problems of subgrouping will reappear in later chapters, and the ideas 
outlined here will be further refined. For the moment Greenberg's (1957: 53) 
summary of the problems inherent in subgrouping, and the methodology 
appropriate to it, will serve to close our preliminary discussion of genetic 
classification: 


There is hardly a feature shared by certain related languages and not others for 
which convergence or borrowing is absolutely excluded as an explanation. Never- 
theless, when in even more difficult cases the evidence is examined closely and in 
the light of general comparative reconstruction of the linguistic history of the entire 
family, certain groupings will normally emerge. Although . .. no single resem- 
blance is ever completely decisive, it will be found that certain languages share with 
one another a far larger number of features, which may be innovations, than they do 
with related languages outside the subgroup, and that among these are some of 
those which are least likely to be the result of convergence, including shared sound 
shifts en masse, sporadic sound changes, new morphological formations, shared 
analogical shifts, including some of the rarer ones, and true lexical innovations. It is 
the sheer number of such resemblances, together with the inclusion of some of the 
types most likely to be innovations, that excludes convergence or borrowing as an 
over-all explanation. 


1.5 TAXON NAMES 


Just as biologists classify plants and animals in terms of taxa such as spe- 
cies, genus, family, order, class, and phylum, linguists use a similar termi- 
nology in classifying languages and groups of languages: dialect, language, 
family, stock, phylum. In the linguistic literature and in this volume, a taxon 
is any genetic node, as defined in definition 2 of section 1.1. It may be a 
“low-level” group such as Romance, Slavic, or Bantu, or a “high-level” unit 
such as Nilo-Saharan, Indo-Pacific, or Amerind. The common ancestor 
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from which а low-level group of languages дегіуев existed at а relatively 
recent date, whereas the common ancestor of a high-level group is consid- 
erably more ancient. In order to give a more precise indication of the level, 
or rank, of a taxon in a hierarchic classification, lnguists, like biologists, 
assign the various taxa to different categories. Running from low-level (ге- 
cent) to high-level (ancient), the linguistic terms commonly used are dialect, 
language, family, stock, and phylum. If finer gradations are needed, one may 
use the prefixes sub- and macro-: a subfamily is a branch of a family; a mac- 
rophylum is a group of phyla. 

Linguists are, however, less consistent in their use of categorical names 
than are biologists; certain terms, especially family and phylum, are fre- 
quently used ambiguously. Thus, in addition to its proper categorical mean- 
ing (a group of related languages), family is often used synonymously with 
the generic terms "group," " node," and "unit"; that is, a family can be any 
genetic group at any level. Phylum, on the other hand, is often used to des- 
ignate any genetic group, whether high-level or low-level, that is not known 
to be related to any other group. Thus the Chukchi-Kamchatkan and 
Eskimo-Aleut families are sometimes called phyla because they do not ex- 
hibit any evident relationship to any other group, even though they are 
clearly not ancient groupings of the Australian or Amerind sort. 

The language/dialect distinction in linguistics parallels the species/variety 
distinction in biology. Whereas a species is usually thought of as a group of 
plants or animals that can interbreed and produce viable progeny, a lan- 
guage is a group of dialects among which there is mutual intelligibility.* As 
Cavalli-Sforza (1984) points out, in both instances we are dealing with com- 
munication, genetic in regard to species, linguistic in regard to language. 
But the analogy is not perfect, since the criterion of mutual intelligibility is 
less well defined than that of interbreeding. Mutual intelligibility ranges 
from close to 0 percent to close to 100 percent; interbreeding is usually an 
all-or-nothing proposition. As was noted in section 1.2, there are many 
gray areas in deciding what to call a language and what to call a dialect. 
Biological speciation is much more clear-cut. Yet the similarities, to which 
Darwin called attention in 1871, are indeed striking. 

Pidgins and creoles are special kinds of languages. Pidgins are simplified 
languages that sometimes arise when people speaking distinct languages 
need to communicate. The simplified language is an amalgam of two (or 
more) languages, but generally one language is dominant. Another defin- 
ing characteristic of pidgin languages is that no one speaks them as a native 


“In linguistics "dialect" does not have the pejorative connotation that it often does in every- 
day parlance. It is simply a variety of speech that is similar to, but different from, other vari- 
eties of speech. The origin of the differences is usually the geographic or social separation of 
different groups. 
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(i.e. first) language; that is, they аге spoken only Бу people who usually 
speak another language. Under certain circumstances a pidgin may become 
“creolized.” Creoles differ from pidgins, initially at least, in only опе ге- 
spect: they are the first language (and sometimes the only language) of a 
group of people. Creole languages gradually lose the simplicity resulting 
from their unique origin as they evolve according to principles governing 
all human languages. 

In this volume the various taxon names are used as they are throughout 
the linguistic literature. In particular, family and phylum are used with both 
of the meanings described above. Group, node, and unit are used inter- 
changeably for genetic units at any level. Stock is used only for groups that 
are intermediate between families and phyla. 
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Europe 


All the languages currently spoken on the European continent, save Basque, 
belong to one of three language families: Indo-European, Uralic, and Cau- 
casian. Basque, spoken in northern Spain by close to a million people, and 
elsewhere in the world by perhaps another million Basque emigrants, is a 
language isolate; that is, it has no obvious relationship with any other lan- 
guage. Some scholars believe it is distantly related to the Caucasian family, 
but this conjecture is hotly disputed by others. 

The Indo-European family is the most widespread, covering most of Eu- 
rope and extending through Asia Minor (Iran, Afghanistan, and Pakistan) 
into India. Emigrants, since 1500, have also carried Indo-European lan- 
guages to the other continents, where they have often become the domi- 
nant languages (e.g. English in Australia and North America, Spanish and 
Portuguese in South America, Russian in northern Asia). The Uralic family, 
which includes Finnish and Hungarian, is found chiefly in northern Eu- 
rope, but extends beyond the Urals into northwest Asia. Hungarian, how- 
ever, is a Uralic island in central Europe, surrounded on all sides by Indo- 
European tongues. The Caucasian languages belong to two very divergent 
subfamilies, which possibly do not together constitute a valid genetic node. 
They are spoken in the mountains of the Caucasus, straddling south- 
east Europe and Asia Minor. Georgian is the only well-known Caucasian 
language. 

This chapter traces the discovery and elaboration of the idea that lan- 
guages may exhibit similarities because they are descendants of a common 
ancestor. After placing the question of language classification in the broader 
historical context of evolutionary studies (section 2.1), the chapter then 
surveys the period up to 1800 (section 2.2). Section 2.3 discusses the birth 
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апа development of comparative linguistics ав it was first elaborated with 
respect to the Indo-European language family in the nineteenth century, 
and sections 2.4 and 2.5 treat the Uralic and Caucasian families, respec- 
tively. Subseguent chapters trace the development of genetic classification 
in other language families. 


2.1 THE HISTORICAL CONTEXT 


Today it does not seem startling that a single language, through the mi- 
gration of a group of its speakers and their subsequent isolation, should іп 
the course of time develop into two distinct languages, as the speech of the 
emigrants and that of the original group gradually drift apart. But this idea 
did not become current, and enjoy consensus, until the first half of the 
nineteenth century, though it had numerous precursors. 

The “genetic hypothesis,” considered by Hockett (1965: 185) to be the 
first major breakthrough in the field of linguistics, must be viewed in a 
broader historical context to be fully appreciated. For the nineteenth cen- 
tury witnessed a fundamental transformation of man's perception of him- 
self, and his place in nature, that was perhaps as significant as the Coper- 
nican Revolution some 300 years earlier. In a number of disparate fields, 
including linguistics, geology, paleontology, and biology, a static, theologi- 
cally inspired view of nature was replaced by a dynamic scientific ар- 
proach. The most celebrated breakthrough was of course Charles Darwin's 
discovery (independently arrived at by Alfred Russel Wallace) that species 
may evolve gradually into other species through the operation of forces 
such as natural selection. Furthermore, Darwin, and linguists of the day 
like August Schleicher, did not fail to note the similarities between their re- 
spective disciplines. As Darwin remarked in one of his later works (1871: 
40), “The formation of different languages and of distinct species, and the 
proofs that both have been developed through a gradual process, are curi- 
ously parallel.” 

To appreciate the extent of this intellectual transformation, let us for a 
moment try to view the natural world as an educated person might have in 
the first decade of the nineteenth century. In general, our outlook would 
have been constrained, if not dominated, by religious dogma in the form of 
“Creationism,” the idea that the world and all its component parts (e.g. 
plants, animals, minerals) came into being by a single act of divine creation. 
Seventeenth-century theologians had even determined the precise date of 
the earth creation: 9:00 A.M., Sunday, October 23, 4004 s.c. Similarities 
between certain animals (e.g. dog-wolf, cat-tiger, man-ape), and between 
certain plants, had of course been noticed, and even systematically classi- 
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fied by people such as Linnaeus. But they had not really been explained, 
except perhaps in terms of "divine order." 

The absence of fossils in certain strata of the earth's crust was explained 
at the start of the nineteenth century by a theory called Catastrophism, ac- 
cording to which all life had periodically been wiped out by calamities such 
as floods and earthquakes. Following each of these disasters, new species 
of animals and plants were created by new acts of creation. The fact that 
plants and animals in later strata often appeared more advanced than simi- 
lar forms in earlier strata was explained by the theory of Progressionism, 
which held that the Creator had made progressively more advanced forms 
after each disaster. 

One of the dominant linguistic themes in the year 1800, as it had been for 
the preceding century, concerned the original language of mankind. The 
Old Testament reveals that all mankind originally shared a single uniform 
language, which God had felt obligated to confound at the Tower of Babel, 
lest the human race become too powerful. But the Bible does not specifi- 
cally identify the original language, and the eighteenth century had wit- 
nessed unbridled speculation on this topic. The most popular candidate 
had for centuries been Hebrew, no doubt because most of the Old Testa- 
ment was written in this language. From time to time other languages were 
also advocated; the Dutchman Goropius Becanus, for example, in 1569 ar- 
gued patriotically for Dutch as mankind's original tongue. 

To some extent this traditional world view had already begun to be un- 
dermined during the eighteenth century, and precursors of nineteenth- 
century developments may be cited in all the fields mentioned above. In 
biology, in particular, people such as Montesquieu, Pierre-Louis de Mauper- 
tuis, Denis Diderot, Georges Buffon, and Carolus Linnaeus had already, in 
the eighteenth century, questioned the idea of the immutability of species. 
Darwin's grandfather, Erasmus Darwin, was an outspoken advocate of evo- 
lution (or transmutation, as it was then known), and Lamarck explicitly 
proposed a theory of evolution in the first decade of the nineteenth century 
and even drew up a genealogical tree of the animal kingdom in 1809. 
Lamarck's explanations of the mechanisms that cause evolution, the most 
notorious of which was the heritability of acquired characteristics, are now 
generally considered incorrect, but his recognition of the existence of evo- 
lution has been amply confirmed. Although the idea of evolution was intu- 
itively grasped by men like Lamarck, today we find the various explana- 
tions offered to account for it fanciful and erroneous. The key to evolution, 
natural selection, was to be Charles Darwin's contribution, but in 1800 that 
event lay 60 years in the future. 

In the field of geology, James Hutton had proposed, toward the end of 
the eighteenth century, that the earth was in fact much older than most 
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contemporaries conceded, and furthermore that the earth’s surface had not 
been formed abruptly by a series of catastrophes, but slowly, by the gradual 
erosion that results from natural forces such as wind, rain, water, and 
ice. Hutton's theory, which came to be known as Uniformitarianism, was 
thought by many to be irreligious, and did not at first make appreciable 
inroads on the Biblically inspired doctrine of Catastrophism. 

Hutton himself died in obscurity in 1797. By coincidence Charles Lyell 
was born that same year, and it was Lyell who would in the 1830's resurrect 
and prove the theory of Uniformitarianism. Lyell’s Principles of Geology 
(1830), in addition to vindicating Hutton's earlier ideas, also had a signifi- 
cant influence on Darwin's thought. Lyell himself, though a close friend of 
Darwin's, was not initially a supporter of evolution. He withheld his assent 
until the publication of the Origin of Species (1859), at which time, however, 
he threw his considerable intellectual weight squarely behind Darwin's 
theory. 

In the realm of human language, the eighteenth century had already 
seen opposition to the idea that Hebrew was mankind's original tongue. 
One of the most vocal opponents was the German mathematician G. W. 
Leibniz, who related Hebrew to Arabic, in the group where it rightly be- 
longs. Toward the end of the century, attention began to shift from the iden- 
tification of man’s original language (on which no real progress had been 
made in spite of an enormous amount of effort) to the study of the Sanskrit 
language in far-off India. News of Sanskrit had for some time been filtering 
back to Europe, and it had already been noticed to exhibit striking simi- 
larities with European languages. In 1786 the orientalist William Jones, at a 
gathering of the Royal Asiatic Society in Calcutta, expressed the opinion 
that Sanskrit, Greek, and Latin, and probably also Gothic, Celtic, and Per- 
sian, "have sprung from some common source, which, perhaps, no longer 
exists" (quoted in Robins 1967: 134). This idea was to dominate linguistic 
research in the nineteenth century, and in the process the question of an 
original human language gradually faded from view—and in some quar- 
ters was even proscribed from scientific discussion. 

By the end of the nineteenth century, an evolutionary approach involv- 
ing gradual change over long periods of time had become dominant in geol- 
ogy, biology, paleontology, and linguistics. In geology the forces that had 
shaped the earth's surface were seen to be operating just as they had in the 
past. Moreover, the proposed age of the earth had grown considerably 
from the original church estimate of some 6,000 years. Uniformitarianism 
had replaced Catastrophism. 

In biology the theory of evolution by natural selection won its battle with 
the Creationists after the appearance of the Origin of Species in 1859, though 
skirmishes between Creationists and Evolutionists still recur from time to 
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time іп the twentieth century, particularly іп the United States. After Dar- 
win it was understood that species are not immutable; rather, the species 
that now ezist are simply the descendants of other species that once inhab- 
ited the earth. The recognition of а historical connection between all forms 
of life, and of the gradual evolution through time of more and more com- 
plex organisms, culminating in the relatively recent appearance on the 
earth of humans, contrasts sharply with the static picture of discrete and 
unchanging categories, all present on the earth from the beginning of time, 
that had been advocated by Creationists at the start of the century. In lin- 
guistics scholars had by the end of the nineteenth century developed meth- 
ods for studying the genetic relationships of languages and even for recon- 
structing portions of languages (such as Indo-European) that had existed 
thousands of years earlier but that had left no written trace, their daughter 
languages (some extinct) being the only available evidence. 

By 1900 educated persons, won over by the nineteenth-century suc- 
cesses in biology, geology, linguistics, and other fields, turned in progres- 
sively larger numbers to science for an understanding of the natural world. 
The role of the church as the ultimate arbiter in scientific disputes, long on 
the decline, more or less evaporated in the nineteenth century, and twen- 
tieth-century inquirers have looked increasingly to science for their final 
answers. 
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2.2 THE LINGUISTIC BACKGROUND 


It is difficult to determine who, and in what year, first realized that simi- 
larities between languages may (sometimes) be explained as traits inherited 
from a common ancestor. What is clear is that this notion took hold at 
the start of the nineteenth century and provoked a period of intense re- 
search on the Indo-European family and general comparative linguistics. 
The event traditionally identified as the starting point for this linguistic 
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revolution is William Jones's address to the Royal Asiatic Society іп 1786, 
but all the standard handbooks hasten to add that Jones was in fact nei- 
ther the first to recognize similarities between Sanskrit and European lan- 
guages nor the first to propose а genetic explanation for such resemblances. 
What he did was to state clearly, іп а prominent forum, an idea whose time 
had come. 

Іп delving into the historical background of the genetic hypothesis, there 
are two distinct guestions we should not confuse. The first is when simi- 
larities between certain languages were first noticed; the second is when 
those resemblances were first erplained as being due to historical descent 
from a common ancestor. With regard to the first guestion, we may safely 
assume that people have recognized similarities between their own speech 
and that of other fairly similar languages ever since human language devel- 
oped on the earth, and probably recognized similarities even during the 
developmental stages leading up to human language. It is today impossible 
for, say, any Rumanian speaker to have modest contact with Italian, or even 
Spanish, without recognizing immediately that these two geographically 
distant languages resemble Rumanian in a way that neighboring Hungarian 
and Bulgarian do not. Wherever in the world languages are fairly similar, it 
seems likely that from time to time such similarities are, and always have 
been, duly noted. 

Explaining similarities between languages is more difficult than simply 
noticing them, and consequently scholars did not generally reach the ge- 
netic hypothesis until the nineteenth century. Exactly when, and with re- 
gard to what group of languages, was the genetic hypothesis first formu- 
lated? It is difficult to find a precise answer in the literature. However, the 
historical record from Classical times to 1800, which is examined in the re- 
mainder of this section, does reveal a number of significant events. 

Already in the first century B.c. the Romans were well aware of numer- 
ous resemblances between Greek, a language for which they had great re- 
spect, and their own language, Latin. These similarities were attributed to 
the Greek origin of the Latin-speaking people, one version of which is 
found in Book I of the Aeneid. Thus the Romans, in pre-Christian times, 
both recognized similarities between their language and another and gave a 
genetic (i.e. historical) explanation for them. This particular genetic expla- 
nation was, however, incorrect, because Latin and Greek both derive from 
Indo-European, rather than one from the other. 

In early Christian times the monogenesis of all languages was widely ac- 
cepted, on the strength of the Biblical account in Genesis of the origin of 
language diversity at the Tower of Babel (see p. 26). The assumption of He- 
brew as mankind's original language was probably unfortunate for the de- 
velopment of genetic linguistics, for it meant that medieval and Renais- 
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sance scholars were facing ап impossibility in trying to derive their own 
European languages from an unrelated Semitic language. To perform this 
Herculean feat scholars were forced, in relating Hebrew words to supposed 
European cognates, to take such phonetic and semantic liberties that the 
entire field of etymology sank into disrepute, from which it did not fully 
recover until the nineteenth century. Voltaire's observation that "etymology 
is a science in which the vowels count for nothing and the consonants for 
very little" is indicative of the low level of credibility that etymology had 
come to enjoy. 

Around the twelfth century the recognition of similarities between lan- 
guages, followed by a genetic hypothesis to explain them, began to appear 
in the written record. In Iceland an unknown scholar, in what has become 
known as the First Grammatical Treatise, concluded that Icelandic and En- 
glish were related because they shared many similar words. Toward the end 
of the twelfth century, Giraldus Cambrensis asserted that Welsh, Cornish, 
and Breton were descendants of a single earlier language, which he called 
the "antiquum linguae Britannicae idioma." Today these languages are clas- 
sified as the Brythonic branch of the Celtic family. Cambrensis went on to 
suggest a relationship between this "ancient Brittanic tongue" and Latin 
and Greek, pointing out lexical resemblances such as Latin sal, Greek hals, 
and Welsh halen, all of which mean ‘salt.’ 

Probably no group of languages offered better prospects for the discov- 
ery of genetic linguistics than the Romance family. Here was a group of lan- 
guages whose resemblances were obvious at a glance (see Table 1.2 above), 
and whose parent language, Latin, not only was abundantly attested in 
written records, but was even, throughout the Middle Ages, the spoken 
language of the church and the academic world. Yet despite these pro- 
pitious circumstances (or perhaps even because of them), the Romance 
family did not play as large a role in unlocking the secrets of genetic classifi- 
cation as it might have. In light of the close-knit nature of the Romance 
group, it seems inescapable that resemblances between Italian, Provencal, 
French, and Spanish must have been noticed regularly throughout the 
Middle Ages, even if such observations were not written down. In the early 
fourteenth century the Italian poet Dante was fully aware of the lexical re- 
semblances between Romance languages. He also knew that languages 
change and may break up into dialects. But he nonetheless never realized 
that the Romance languages were simply the historical continuation of the 
Latin spoken by the Romans more than 1,000 years earlier. 

Apparently the genetic hypothesis was initially proposed for the Ro- 
mance languages in the first half of the fifteenth century, although most of 
the standard Romance handbooks do not discuss the history of the genetic 
hypothesis with regard to Romance. A notable exception is Hall (1974: 231), 
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who gives the following account: “А few fifteenth-century scholars, e.g. 
Flavio Biondo (1388—1463), had considered the modern idiom [i.e. Italian] 
to be the direct continuation, unchanged, of popular Latin; but others, be- 
ginning with Poggio Bracciolini (1380-1459) and Leonardo Bruni (1370- 
1444), saw that the modern Romance tongues, including Roumanian, were 
the result of linguistic change as it had affected Latin." Yet this early recog- 
nition of the genetic hypothesis did not instigate research on other lan- 
guage groups, but remained rather an isolated event, a dormant seed that 
would not begin to sprout for several more centuries. 

In retrospect we can perhaps understand why the Romance example did 
not kindle success elsewhere, for the Romance group is, or at least was, 
almost unique in preserving the common ancestor (Latin) side by side with 
its daughter languages (Italian, French, Spanish, etc.). In the Middle Ages 
the anomalous, and artificial, nature of the preservation of Classical Latin 
was not clearly recognized. Nor was it realized that the ancestor of a group 
of languages may itself no longer exist, except in the form of its descendant 
languages. In fact the Romance case, which is probably the first widely rec- 
ognized example of the genetic hypothesis, actually encouraged the search 
for linguistic ancestors among the world's living languages. If the special 
circumstances surrounding Latin contributed to the early recognition of 
the genetic hypothesis with respect to Romance, they probably obscured 
its implications for other language families. 

Grammars of languages other than Latin and Classical Greek first began 
to appear at the end of the fifteenth century. During the following centuries 
grammars of languages from the most distant parts of the earth were pub- 
lished with greater and greater frequency. These grammars were written for 
a variety of purposes by a disparate group of authors that included mission- 
aries, diplomats, explorers, and even an occasional layman inspired by 
nothing more than personal curiosity. They frequently suffer from a some- 
what slavish adherence to Latin grammar, but since this was the only de- 
scriptive model widely available to Europeans at the time, a certain forced 
imitation was perhaps unavoidable in the beginning. Furthermore, a preoc- 
cupation with "universal grammatical categories," a legacy of the Greeks, 
made it seem natural to seek to identify those grammatical categories as- 
sumed to be universal, in particular grammars of individual languages. De- 
spite their obvious flaws, these early grammars were of great importance 
for two reasons. First, they revealed beyond question that European lan- 
guages possessed grammatical categories that were more often than not ab- 
sent in other languages (e.g. gender), while at the same time grammatical 
categories widespread outside Europe were sometimes totally lacking in 
European languages (e.g. the distinction between inclusive and exclusive 
‘we,’ first described in a Spanish grammar of Aymara in 1603). 
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Тһе second important impact of the gradually accumulating linguistic 
documentation was that it led inevitably to a comparison of one language 
with another. As more and more languages became known, attempts were 
made to survey the world's languages. The first such general survey, the 
Mithridates of the Swiss scholar Conrad Gesner, appeared in 1555. That 
work (named after a king іп Asia Minor in the first century в.с. who was 
reputed to know 25 languages) gave comparative wordlists for a number of 
Janguages. It derived Italian, Spanish, and French from Latin, and all lan- 
guages from Hebrew. 

Other general surveys followed at irregular intervals, often with a speci- 
men translation of the Lord's Prayer іп each language. Іп 1592 the German 
Hieronymous Megiser published Specimen KL linguarum et dialectorum 
(Specimens of Forty Languages and Dialects); a second edition in 1603 in- 
creased the number to 50. 

One of the more ambitious early attempts at collecting wordlists, and 
thereby classifying languages into groups, was carried out by Catherine the 
Great of Russia toward the end of the eighteenth century. Catherine com- 
missioned P. 5. Pallas to collect and organize wordlists from a broad sample 
of the world's languages, and it is usually Pallas's name that is associated 
with the result of this project, which was published from 1786 to 1789 and 
contained information on 200 languages. However, according to Key (1977), 
it was really Catherine herself who decided the ordering of the languages 
in this work, and that order was to a very large extent genetic. Slavic lan- 
guages came first, followed by Celtic, Greek, Romance, Germanic, Baltic, 
Caucasian, Finno-Ugric, Iranian, Semitic, Turkic, and so on. There were of 
course a few errors (e.g. Basque was included in Celtic, and Yiddish was 
placed in Semitic), and in the second half of the list the ordering becomes 
more geographic than genetic, with Hawaiian the last language listed. А 
second edition of this work (1791) added 80 more languages, including for 
the first time African and Amerindian tongues. Catherine's success in sort- 
ing out so many valid genetic groups is less surprising than one might 
think. In addition to having collected the requisite data for genetic classifi- 
cation (wordlists of 285 words each), Catherine was herself trilingual in 
German, Russian, and French, each of which is a member of a different 
branch of Indo-European. Ironically, although Catherine's classification of 
the world's languages began to appear the same year that William Jones de- 
livered his famous pronouncement on Indo-European (1786), Catherine's 
contributions to genetic linguistics have today been all but forgotten, while 
Jones's discourse has assumed a quasi-mythical status. 

During the first decade of the nineteenth century, a Spanish Jesuit, 
Lorenzo Hervás y Panduro, published a six-volume compendium contain- 
ing material on 300 languages. The last Mithridates was published by J. C. 
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Adelung from 1806 to 1817 іп four volumes, апа contained, among other 
things, the Lord's Prayer in 500 languages. In that work Adelung recognized 
the affinity of Sanskrit with European languages, a topic then on the verge 
of replacing interest in comparative wordlists with attention to the recently 
established Indo-European (І-Е) family. 

With this gradual accumulation of linguistic documentation, it was inevi- 
table that from time to time people would attempt to classify the known 
languages into groups, usually on the basis of lexical similarities. One of 
the more successful early attempts, by J. J. Scaliger, appeared in 1610. In 
that work Scaliger divided European languages into 11 groups on the basis 
of the word for 'God' in each (see List 2.1). Significantly, all 11 of these 
groups are correct genetic units, though some of the groups are now 
known to contain additional languages of which Scaliger was unaware. 
Scaliger did not recognize, and in fact explicitly denied, any relationship 
between these 11 groups, thus failing to recognize that groups 1, 2, 3, 4, 5, 
9, and 10 belong to the I-E family (within which 9 and 10 constitute the 
Celtic subfamily), and 7 and 8 are members of what is today called the 
Finno-Ugric family. 


List 2.1. European Language Families (Scaliger 1610) 


1. SLAVIC 6. TARTAR 9. Irish and 

2. GERMANIC 7. Hungarian Scottish Gaelic 
3. ROMANCE 8. Finnish (and 10. Welsh and Breton 
4. Greek Saami — Lapp) 11. Basque 

5 


. Albanian 





Scaliger's classification is unusual in the broad range of languages taken 
into consideration. More frequently the recognition of similarities between 
languages was more restricted, often being limited to just two languages. 
For example, in 1615 the Lithuanian Michalon claimed a relationship be- 
tween Lithuanian and Latin, though he rejected one between the even 
more closely related Lithuanian and Russian. In 1671 the Swedish scholar 
G. Stiernhielm compared the inflectional endings of Latin habere 'to have' 
and Gothic haban ‘to have’ (the stems of which, hab-, are not cognate de- 
spite appearances), and concluded that both languages must be descen- 
dants of a common ancestor. In 1686 the Swedish scholar А. Jager, in a pub- 
lic lecture, pictured an ancient language spreading over Europe, breaking 
up into daughter languages, which in turn led to Persian, Greek, Romance, 
Slavic, Celtic, Gothic, and Germanic languages. Also recognized during the 
seventeenth century was the Baltic group (Lithuanian, Latvian, and Old 
Prussian), as well as the close affinity between this group and Slavic. 
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Тһе eighteenth century witnessed additional discoveries of genetic links 
between various languages, including for the first time groups outside Eu- 
rope. In 1702 Hiob Ludolf compared all known Semitic languages, and in 
1707 E. Lhuyd founded comparative Celtic grammar by joining Scaliger's 
groups 9 and 10 to form the Celtic family. In 1730 the Swede Phillip Johann 
von Strahlenberg published a work in which he employed the mass com- 
parison of basic vocabulary to classify the languages of Russia. According 
to Jakobson (1942: 603), Strahlenberg “established with an impressive ex- 
actitude nearly all the families of the non-Indo-European languages of Rus- 
sia.” In 1767 P. Coeurdoux asked the French Academy for an explanation of 
similarities between Sanskrit and Greek and Latin, similarities that he be- 
lieved to be relics of mankind's original language. Іп 1786 William Jones 
gave his famous address and attributed the similarities to common origin, 
though he did not imply they went all the way back to the Tower of Babel. 

In 1770 J. Sajnovics connected Hungarian with Lapp, and in 1799 Samuel 
Gyarmathi established the affinity of Hungarian and Finnish. The genetic 
group to which Finnish, Lapp, and Hungarian belong is today called 
Finno-Ugric. 

In 1778 William Marsden recognized striking similarities between lan- 
guages on the east and west coasts of Africa; today these languages are con- 
sidered members of the Bantu group. Three years later Marsden pointed 
out resemblances between the language of Madagascar (i.e. Malagasy) and 
the Polynesian languages, all of which belong to the Austronesian family. 
On this point, however, Marsden had already been foreshadowed by 
Hadrian Reland, who apparently had made a similar observation in 1706, 
and Frederick de Houtman, who had perceived the affinity between Malay 
and Malagasy as early as 1603. In the New World the Iroquoian, Algon- 
quian, Carib, and Arawakan families were all identified before 1800. 

We see then that by 1800 a not insignificant number of genetic groups 
had already been identified, but in most cases the true geographic extent of 
these groups was not yet known. For certain groups (e.g. Romance, Se- 
mitic, Finno-Ugric) the genetic hypothesis was already the accepted expla- 
nation for similarities, though for other groups, including І-Е, the genetic 
hypothesis had been advanced without necessarily gaining wide accep- 
tance. In the nineteenth century genetic classification left the realm of spo- 
radic observations by isolated scholars and became for the first time the 
center of focus of intensive academic research. Although there was in fact 
great progress in classifying languages from all parts of the world in the 
nineteenth century, it was the І-Е family that dominated attention and es- 
tablished the base for the development of general comparative linguistics. 
We shall explore that development іп the following section. 
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2.3 INDO-HITTITE 


Although the Indo-Hittite (or Indo-European) family contains only about 
140 of the world's 5,000 languages, its two billion speakers constitute almost 
half of the world's population. Since about 1500 А.р. Indo-European lan- 
guages have been spread around the world by the European colonial move- 
ment to an extent that obscures their original provenance. Map 2.1 shows 
the geographic distribution of the Indo-Hittite family around 1500, though 
both the Anatolian and Tocharian branches (already extinct) are also shown. 

The 10 branches that make up this family vary greatly in number of 
languages and number of speakers. Two of the branches, Anatolian and 
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Map 2.1. The Indo-Hittite family. Turkish is now spoken where the Anatolian languages 
were spoken; several Turkic languages have supplanted Tocharian. 


Tocharian, аге extinct. Several Anatolian languages were formerly spoken іп 
present-day Turkey; Tocharian languages were found in western China. Ar- 
menian and Albanian, though single languages, are independent branches 
of Indo-Hittite. Armenian has today about five million speakers, mostly іп 
the U.S.S.R., though there are substantial numbers of Armenian speakers 
in many countries around the world. The original homeland of the Arme- 
nians was in what is today eastern Turkey. But a massacre by the Turks at 
the beginning of this century decimated the Armenian population in that 
region and scattered the survivors throughout the world. Albanian is spoken 
in Albania and surrounding parts of Yugoslavia and Greece by roughly 
four million people; it consists of two very divergent dialects (Gheg and 
Tosk), which are not mutually intelligible. Standard Albanian is based on 
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the Tosk dialect. Greek is spoken in Greece by 10 million people. In addition 
to standard Greek, Tsakonian, with perhaps 10,000 speakers on the east 
coast of the Peloponnesus, continues the dialect of ancient Sparta. 

Four Celtic languages in Ireland, in Great Britain, and on the north coast 
of France are all that is left of a family that once extended through Europe 
into Asia Minor. Breton, spoken in northern France by more than a million 
people, is numerically the strongest, but its use is discouraged by the 
French government. The use of Irish (= Gaelic) is encouraged by the Irish 
government, but the estimate of a half-million speakers is probably over- 
optimistic. Welsh, spoken in Wales, has an estimated 750,000 speakers. 
There are very few monolingual speakers of any Celtic language; those 
who speak Breton are mostly bilingual in French, those who speak the 
others are bilingual in English. 

Originally a number of Italic languages were spoken in what is today 
Italy; all but Latin eventually became extinct. The descendants of Latin, 
known collectively as the Romance family, consist of some 16 languages 
with around 500 million speakers. The numerically strongest Romance lan- 
guages (with millions of speakers in parentheses) are Spanish (200), Por- 
tuguese (130), French (66), Italian (60), Rumanian (22), Provengal (9), Cata- 
lan (6), and Sardinian (1). Spanish is spoken not only in Spain, but also in 
most of Central and South America. Portuguese is spoken in Brazil as well 
as Portugal. Various Provengal dialects are spoken in the south of France, 
but they have been steadily eroded by French, and the estimate of nine mil- 
lion speakers is probably exaggerated; almost all Provengal speakers are by 
now bilingual in French. Catalan is spoken primarily in eastern Spain, 
where it remains strong despite the efforts of the Franco regime to restrict 
its use. 

About a dozen Germanic languages are spoken by some 450 million 
people. English, with about 300 million first-language speakers (and mil- 
lions more who speak it as a second language) is numerically the strongest. 
Though English was originally restricted to Great Britain, the colonial 
movement has resulted in its being dominant today in Ireland, the United 
States, Canada, Australia, and New Zealand; it is also widely used in other 
parts of the world (e.g. India, South Africa). Other important Germanic 
languages (with millions of speakers in parentheses) are German (100), 
Dutch-Flemish (20), Swedish (9), Danish (5), Norwegian (5), Yiddish (4), 
Afrikaans (4), and Icelandic (.25). Yiddish has developed from German, 
and Afrikaans from Dutch. Despite their separate national status, Swedish, 
Danish, and Norwegian are mutually intelligible. 

The Balto-Slavic family consists of 15 languages, with a total of some 290 
million speakers, and has two clearly defined branches, Baltic and Slavic. 
Two Baltic languages survive today; Lithuanian has three million speakers, 
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апа Latvian, two million. Both аге spoken along the western coast of the 
U.S.S.R., just north of Poland. The Slavic family contains roughly a dozen 
languages and 285 million speakers. Russian dominates, with 150 million 
first-language speakers апа millions more who use it аз а second language. 
Other Slavic languages, with millions of speakers in parentheses, are 
Ukrainian (40), Polish (35), Serbo-Croatian (17), Czech-Slovak (14), Byelo- 
russian (10), Bulgarian (8), Slovene (2), and Macedonian (1). Ukrainian and 
Byelorussian (-- White Russian) are spoken to the south and west, respec- 
tively, of Russian in the U.S.S.R. Serbo-Croatian is the national language of 
Yugoslavia, but Slovene is spoken in the north of that country, and Mac- 
edonian in the south. 

The largest subfamily of the Indo-Hittite family is Indo-Iranian, with 
about 90 languages and 680 million speakers. Indo-Iranian languages are 
spoken in Iran, Afghanistan, Pakistan, India, Nepal, Bangladesh, and Sri 
Lanka. In addition, Romany, the language of the gypsies, is spoken іп а 
number of mutually unintelligible dialects throughout western Europe and 
to a lesser degree in countries around the world. No fewer than 25 Indo- 
Iranian languages have a million speakers or more (the figures in paren- 
theses below indicate number of speakers, in millions). In Iran there is Farsi 
(= Persian, 20), which extends into Afghanistan; Mazanderani (2); and 
Kurdish (5), also found іп Iraq and Turkey. About 13 million Pashto speak- 
ers are found in Afghanistan and Pakistan. The most widespread Indic lan- 
guage is Hindi-Urdu, with 200 million first-language speakers and millions 
more second-language speakers; it is spoken in northern India and Pakistan. 
Sindhi (7) is also divided between India and Pakistan; Lahnda (15) and Ba- 
luchi (2) are found primarily in Pakistan. Nepali (10) is spoken in Nepal, 
and Sinhalese (10) on the island of Sri Lanka. Bengali (140) is spoken in 
Bangladesh and northeastern India. In addition to Hindi-Urdu, Bengali, 
and Sindhi, which are also spoken in neighboring countries, the follow- 
ing languages are located primarily in India: Awadhi (55), Marathi (45), Pun- 
jabi (40), Bihari (30), Gujarati (30), Oriya (20), Marwari (13), Kashmiri (3), 
Bhili (3), Pahari (2), Banjari (2), Garhwali (1), and Kumauni (1). 

We begin our discussion of genetic classification with the Indo-Hittite 
family not because it is the most widespread in the world, but because it is 
in terms of this family that the principles of genetic classification were, in 
the nineteenth century, first established. 


2.3.1 History of Indo-Hittite Classification 


When William Jones delivered his famous address in 1786, several lan- 
guage families (e.g. Romance, Germanic, Slavic, Baltic, Celtic) had already 
been recognized for some time, and occasionally, as we saw in the preced- 
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ing section, similarities between these families had been noticed. More 
often than not such comparisons were binary (i.e. only two families were 
taken into consideration), and the potential relevance of other languages or 
language groups was disregarded. This binary approach was characteristic 
of many prescientific classificatory attempts. А person speaking one lan- 
guage would by historical accident find himself in a locality where the 
people spoke a language with similarities to his own. Or a scholar inter- 
ested in one group of languages might casually notice resemblances with 
another family. But there was seldom any attempt to collect, organize, and 
evaluate all of the evidence; research was directed more Бу happenstance 
than by design. It is to Jones's credit that he included six distinct groups іп 
his putative family: Sanskrit, Greek, Latin, Germanic, Celtic, and Persian. 

Ав soon as the І-Е family was firmly established, during the second dec- 
ade of the nineteenth century, the binary approach to language classifica- 
tion was superseded by what may be called the global approach. The binary 
approach examined only two pieces of a puzzle at a time; the global ap- 
proach dictated that all pieces of the puzzle must be considered simultane- 
ously. Any arbitrary restriction of the field of investigation to, say, two 
branches came to be considered methodologically inappropriate (unless of 
course those two branches were thought to share a special relationship, 
e.g. Indic and Iranian, Baltic and Slavic, Italic and Celtic). But the binary 
approach has remained in fashion in the classification of languages from 
other parts of the world up to the present day, often with lamentable conse- 
quences, which will be examined in section 6.3. 

It would be hard to overestimate the importance of Sanskrit in the dis- 
covery and subsequent elaboration of the І-Е family. That Sanskrit dis- 
played similarities with European languages had been known, at the time 
of Jones's address, for some 200 years, ever since the Italian Filippo Sassetti, 
who lived on the west coast of India from 1581 to 1588, had reported strik- 
ing resemblances between Sanskrit and Italian. Around 1800 the Romantic 
spirit of the times brought Sanskrit into great prominence in Europe; here 
was a language clearly affiliated with European tongues, yet found in far-off 
India. The idea that this exotic tongue was related to their own languages 
excited the imagination of European society. In 1808 F. von Schiegel claimed 
that Sanskrit was in fact the parent language of Greek, Latin, Persian, and 
Germanic, and was probably related to Armenian, Slavic, and Celtic. This 
claim has since been found to be incorrect, for Sanskrit represents merely 
one branch coordinate with each of the other branches. Yet Schlegel's book 
was enormously popular, and its success served both to attract the general 
attention of the public and to stimulate research on Sanskrit by the schol- 
arly community. 

Though it is peripheral to our concerns, we should nonetheless mention 
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that the discovery of Sanskrit was important for the development of lin- 
guistic science in Europe in other ways. It turned out that besides being а 
cousin of the European languages, Sanskrit also possessed a long and dis- 
tinguished tradition in descriptive (as opposed to historical) linguistics. 
Grammars of Sanskrit were far more sophisticated than any of the gram- 
mars hitherto prepared by European scholars following the Latin model. 
The most famous Sanskrit grammar, that of Panini, has been called by the 
American linguist Leonard Bloomfield (1933: 11) “one of the greatest monu- 
ments of human intelligence. . . . No other language, to this day, has been 
so perfectly described.” Despite this illustrious descriptive tradition, how- 
ever, the Indian grammarians, like the Greeks and Romans before them, 
had been obsessed with their own language and had paid scant attention to 
other tongues. They were primarily concerned with describing and pre- 
serving their sacred language, Sanskrit, and problems of language classifi- 
cation appear to have interested them not at all. 

In spite of many valuable early observations connecting one I-E language 
(or language group) with another, a concentrated, sustained attack on the 
problem did not really get under way until the second decade of the nine- 
teenth century. The first significant scholarly work was a prize-winning es- 
say by a Dane, Rasmus Rask, completed in 1814 while the author was in 
Iceland, but not published until 1818. Through a careful examination and 
comparison of inflectional endings and basic vocabulary, and with an aware- 
ness of sound correspondences between languages that foreshadowed I-E 
research a half-century later, Rask proved beyond doubt that the Germanic, 
Latin, Greek, Slavic, and Baltic languages are all genetically related. Rask's 
monograph suffered, however, from two practical defects that undoubtedly 
diminished its impact. First, it was written in Danish, sharply curtailing its 
potential audience; and second, it ignored Sanskrit, a language that was at 
the time a fashionable topic in both popular society and learned circles. 

The second influential book on the I-E family, by the German Franz 
Bopp, appeared in 1816 and suffered from neither of the defects of Rask's 
monograph. Like Rask, Bopp concentrated on inflectional endings and un- 
covered such an abundance of similarities that the genetic relationship of 
Latin, Greek, Germanic, Persian, and Sanskrit would never again be seri- 
ously questioned. Furthermore, Bopp demonstrated that Sanskrit was not 
the source of the I-E languages, but simply another branch of I-E, on the 
same taxonomic level as other branches such as Latin, Greek, Germanic, 
and Slavic. 

It was not by chance that Rask and Bopp both chose inflectional endings 
as the focus of their research. As we have already seen, the science of ety- 
mology enjoyed a very poor reputation at this time, due primarily to its in- 
ability to distinguish probable (or even possible) phonetic and semantic 
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shifts from improbable ones. Such lack of constraints had for centuries 
allowed scholars to derive words in any language from Biblical Hebrew 
through a series of totally improbable semantic and phonetic shifts. The 
search for cognates was further confounded by an inability to distinguish 
words that had been borrowed from another language from words that had 
been inherited from earlier stages of the same language. In choosing to in- 
vestigate the inflectional apparatus of the I-E languages, Rask and Bopp 
avoided the problem of semantic similarity by examining inflectional end- 
ings whose semantic equivalency was taken for granted. Because inflec- 
tional endings are seldom borrowed, the confusing factor of loans was also 
eliminated. The only major variable was phonetic similarity, and this proved 
to be transparent enough to readily convince those who examined the evi- 
dence of the genetic unity of I-E. 

It is nevertheless largely a historical accident that the I-E family was es- 
tablished by this line of proof. In the first place the I-E inflections are them- 
selves accidents of history; many languages are structurally quite different 
from І-Е and lack such inflections altogether. But even had the І-Е lan- 
guages been of a structurally different type, it seems unlikely that the ab- 
sence of inflections would have retarded the recognition of I-E as a genetic 
unit for any significant length of time. Rather, the proof would have been 
made along different lines, most probably using fundamental vocabulary. 
The early reliance on inflections for proof of genetic affiliation served the 
same purpose as training wheels on a bicycle: one doesn't really need them 
to ride a bike (or even to learn to ride a bike), but the novice often feels 
comforted by their assistance. The early focus on inflectional resemblances 
was a similar crutch, as Holger Pedersen (1931: 245) has ably argued: 


That agreement in the inflectional system is an especially clear and striking proof of 
kinship, no one denies. But it is only an anachronism in theory, which has no sig- 
nificance in actual practice, when such an argument is still designated as the only 
valid proof. No one doubted, after the first communication about Tokharian, . . . 
that the language was Indo-European, although at that time virtually no similarities 
in inflection had been pointed out. Such similarities have since been shown, but 
even where they are almost obliterated, proof of kinship could be adduced from the 
vocabulary and from sound-laws. 


I will suggest in section 3.6 that Pedersen is even here overcautious, and 
that proof of genetic kinship may in most cases be adduced from the funda- 
mental vocabulary alone. Sound-laws, inflectional similarities, and other 
arbitrary classificatory criteria are no more decisive in determining genetic 
relationship than are lexical resemblances. 

Although inflectional similarities served admirably to put the European 
scholarly community on the right track in the quest for the origin of their 
languages, exclusive reliance on this criterion at times impeded, rather than 
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aided, language classification. Rask, for example, was fully aware of lexical 
resemblances between Celtic languages and the other languages for which 
he proposed genetic affiliation. But the Celtic languages do not display (or 
at least display less clearly) the inflectional similarities that linked the other 
groups so readily. Rask chose therefore to ignore the lexical resemblances 
as untrustworthy and excluded Celtic from his 1814 essay. Three years later 
Rask changed his mind on this point, and as soon as he became familiar 
with Sanskrit and Persian, he also added them to the family. Bopp appar- 
ently remained unconvinced of the І-Е affinity of Celtic until his 1839 
monograph. 

By 1820 it had been established, primarily through the work of Rask and 
Bopp, that the Indo-European family covered most of Europe and extended 
through Asia Minor into northern India (see List 2.2). 


List 2.2. The Indo-European Family (c. 1820) 


INDO-EUROPEAN: 
I GREEK: Greek 
II ITALIC: Latin, .. . 
HI CELTIC: Irish, Breton, .. . 
IV GERMANIC: German, English, . . . 
V SLAVIC: Russian, Polish, . . . 
VI BALTIC: Lithuanian, Latvian 
VII IRANIAN: Farsi (= Persian), . . . 
VIII INDIC: Sanskrit, Hindi, Bengali, . . . 


Besides Rask and Bopp, there was a third major figure in the founding 
and initial development of comparative I-E grammar. This was the German 
scholar Jacob Grimm, who is better known for compiling and publishing, 
with his brother Wilhelm, a collection of children's stories, Grimms' Fairy 
Tales. Within the scholarly field Grimm produced a Deutsche Grammatik 
(Germanic Grammar), of which the second edition of Volume 1 (1822) is 
more often cited than the 1819 first edition. Grimm's work was influential 
in three respects. First, it uncovered the presence of ablaut (e.g. vowel alter- 
nation between morphologically related forms, as in ‘sing, sang, sung’) not 
only in Germanic, but in the I-E parent language as well. Second, it stressed 
the importance of systematic phonetic correspondences between related 
languages. Third, it served as a model for the comparative study of a 
branch of I-E, taking into account and weighing carefully the lower-level, 
more concrete evidence of the Germanic languages with the higher-level, 
more abstract evidence of the other branches of I-E. 

Grimm was not the first to recognize systematic sound correspondences 
between related languages. Indeed it is clear that he owed much of his 
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awareness оп this point to Rasks monograph. But Grimm nonetheless рор- 
ularized the notion of sound correspondence to such ап extent that the 
most famous of all sound correspondences, though first stated explicitly by 
Rask, is today called Grimm's Law. Though Grimm may have discovered 
this correspondence in Rask's work, he neither named it after himself nor 
called it a law. The term Grimm's Law was invented by Max Müller in 
the 1860's. Since sound correspondences will play a central role in the 
subsequent history of the I-E family (and other genetic groupings as well), 
it is appropriate at this point to examine this most famous of all sound 
correspondences. 

In simplest terms, Grimm's Law specifies the precise manner in which 
Germanic consonants are related to different consonants in the other 
branches of I-E. Using Gothic as the earliest attested Germanic language, 
the correspondences subsumed under Grimm's Law may be pictured in 
slightly simplified form as shown in Table 2.1 (adapted from Lyons 1968: 
27). What the law represents, then, are the sound changes (or sound shifts, 
as Grimm would have called them) that transformed the I-E vocabulary into 
various Germanic cognates (the asterisks identify reconstructed sounds). 
Corresponding to a presumed І-Е “рей — “рой ‘foot,’ we have Latin pedis, 
Greek podos, Sanskrit padas, Gothic fotus (and English ‘foot’), thus illustrat- 
ing the first column of Table 2.1. Similar examples for the other columns 
may be found in the basic handbooks. It should be noted that the changes 
implied in Table 2.1 were not necessarily accomplished in a single step. If, 
for example, I-E "g became Latin H, it is unlikely to have done so directly, as 
the sound correspondence implies: 


(1) I-E *g > Latin h 


Far more likely, there were intermediate stages, now telescoped, separating 
the initial and final stages shown in (1). One might hypothesize a develop- 
ment such as: 


(2) L-E *g > k" (as in Greek) > x > Latin h 


If one understands the phonetics of the sounds listed in Table 2.1, it is 
apparent that Grimm's Law consists of three separate subrules, which ас- 
count for the correspondences designated A, B, and C in Table 2.1: 


Subrule A: 1-Е voiceless stops become voiceless fricatives in Germanic (p > f, 
t > 0, k > h). 


Subrule B: I-E voiced stops become voiceless stops in Germanic (b > p, d > 
t, g > k). 


Subrule C: I-E voiced aspirates become voiced stops in Germanic (b > b, d > 
d, g > g). 
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Table 2.1. Grimm's Law 


Consonant group 
Language А B C 
І-Е *p *t *k *b *d *g *b “d ж 
Пайп р t k b d g f f h 
Greek р t k b d g p" th kh 
Sanskrit Р t Š b d J b d h 
Gothic f 0 h p t k b d Р; 


Тһе ехїепї to which these three subrules аге interrelated, ог causally соп- 
nected, is still a matter of dispute. 

The 1830's saw further significant developments іп І-Е research. Іп 1833, 
the year after Rask's death, Bopp began publishing the first comparative 
grammar of LE. Publication of this work was not completed until 1852, and 
Old Slavic was included only starting with the second volume. Armenian 
was added to the second edition (1857—61), and the third and final edition 
appeared іп 1868, the year after Bopp's death. Little attuned to the sound 
shifts that so fascinated both Rask and Grimm, Bopp remained all his life a 
devotee of morphology, especially inflectional morphology. 

From 1833 to 1836 A. F. Pott published a work that both revitalized the 
long-moribund science of etymology and established comparative І-Е рһо- 
netics. In 1839 Bopp demonstrated conclusively the І-Е affinity of the Celtic 
languages, and in 1854 he recognized that Albanian constitutes a separate 
branch of I-E. Work on collateral branches of I-E was also progressing dur- 
ing this period, paralleling Grimm's work on Germanic. Among various im- 
portant contributions, one may cite Diez (1836—44), von Miklosich (1852- 
75), Zeuss (1853), and Curtius (1858-62). The first linguistics journal 
devoted to the I-E family began to appear in 1851. 

Of the second-generation Indo-Europeanists, no one was more colorful, 
or controversial, than the German August Schleicher. Originally trained as 
a botanist, Schleicher considered language a living organism, and lin- 
guistics a branch of the natural sciences. He thus rejected the traditional 
view of language as a set of social conventions passed on from generation 
to generation and properly studied within the humanities or social sci- 
ences. In a discipline that revered the quiet contemplation of ancient texts 
in libraries and private studies, Schleicher undertook what can only be de- 
scribed as anthropological fieldwork in the inhospitable Baltic region. The 
result was a splendid Handbuch der litauischen Sprache (Lithuanian Hand- 
book). The first volume of this work (1856) was a grammar of Lithuanian, 


2.3 Indo-Hittite 45 


which was followed а year later Бу а second volume containing а collection 
of stories, songs, riddles, and proverbs upon which the grammar had been 
based. 

Schleicher's investigation of Lithuanian paid off more handsomely than 
even he could have anticipated, for it turned out that modern Lithuanian is 
іп certain respects as archaic as extinct languages known from documents 
several thousand years old (e.g. Sanskrit, Latin, Classical Greek). Indo- 
Europeanists had up to this time undervalued modern languages such as 
Lithuanian, which possessed no ancient texts, feeling that since languages 
are constantly changing, the modern stages could not possibly preserve 
any traits that would not be better preserved in the early texts. This belief 
was bolstered by the high esteem in which Sanskrit was held. Schleicher's 
work on Lithuanian showed very clearly that different areas of linguistic 
structure may age at different rates, and thus a language that is conservative 
with respect to inflectional morphology may be innovative with regard to 
phonology. 

Whereas earlier scholars such as Rask, Bopp, Grimm, and Pott had been 
content with pointing out resemblances (such as cognate words) between 
different branches of I-E, Schleicher went one step further and attempted to 
use that information to reconstruct the I-E words from which the known 
cognates must have diverged. It was thus the I-E parent language itself, 
rather than its reflection in the daughter languages, that was the focus of 
Schleicher's Compendium (1861-62). In this work Schleicher introduced the 
practice of identifying reconstructed forms with an asterisk. The custom 
has been maintained ever since as a convenient way of distinguishing actu- 
ally attested earlier stages from unattested reconstructions. For example, 
Schleicher reconstructed І-Е *akva-s on the basis of the following forms for 
"horse": 


(3) Sanskrit a$va-s Avestan aspa- 
Стек — híppo-s Old English eoh 
Latin equu-s Old Irish | ech 


The attempt to reconstruct actual words led Schleicher to attempt a re- 
construction of an entire fable in I-E, much to the chagrin of his more staid 
colleagues. Of course the reconstructed fable was incorrect in many re- 
spects; today an awareness of the overwhelming difficulties involved in 
such an enterprise discourages even the attempt. Yet for all its flaws, 
Schleicher's reconstructed fable served a useful purpose in focusing atten- 
tion on the existence of the parent language itself, rather than merely dwell- 
ing on perceived similarities between daughter languages, as his predeces- 
sors were wont to do. 

It was also Schleicher who first drew a branching tree diagram for the I-E 


46 2 EUROPE 


family. Given his background ав а botanist and his interest іп evolutionary 
studies, this is not surprising. In his 1861 Compendium Schleicher proposed 
for the І-Е family the subgrouping in List 2.3. І have substituted modern 
eguivalents for names that have fallen into disuse; groups that are in fact 
single languages (e.g. Albanian) аге not capitalized in the lists in this vol- 
ume. One should note that all splits are binary, and that Armenian is still 
considered a member of the Iranian family. It was not until 1877 that the 
German scholar Heinrich Hübschmann finally sorted out the massive body 
of Iranian loanwords and demonstrated that Armenian, though a single 
language, constitutes a separate and distinct branch of I-E. 


List 2.3. The Indo-European Family (Schleicher 1861) 


INDO-EUROPEAN: 
I SLAVO-GERMANIC: 













ІП INDO-CELTIC] 


A GERMANIC B GREEK-CELTIC: 
B BALTO-SLAVIC: 1 GREEK-ALBANIAN: 
1 BALTIC a GREEK 
2 SLAVIC b Albanian 
П INDO-CELTIC: 2 ITALO-CELTIC: 
А INDO-IRANIAN: a ITALIC 


1 IRANIAN (including 
Armenian) 
2 INDIC 


b CELTIC 





It is also not surprising that the ex-botanist Schleicher should have been 
one of the first to embrace Darwinian principles, as he did in an 1863 mono- 
graph. It must have seemed to Schleicher in the 1860 that he was riding the 
wave of the future; yet subsequent history reveals that in many respects it 
was the wave of the past. Research on I-E in the nineteenth century is often 
divided into two periods. The first period began with the pioneering work 
of Rask, Bopp, and Grimm and lasted roughly through the 1860's. The sec- 
ond phase commenced in the 18705 and extended into the twentieth cen- 
tury. When Schleicher died in 1868, at the age of 47, the first phase of I-E 
research was drawing to a close. During the second period certain themes 
that had played a prominent role in the early work would either fade in im- 
portance (e.g. inflectional endings, Sanskrit) or disappear altogether (e.g. 
Schleicher's treatment of language as a living organism). Other issues, some 
of them foreshadowed in the first period, arose to take their place (e.g. pho- 
netics, sound laws, the regularity of sound change, syntax). 

It is customary in appraising the work of the early scholars to emphasize 
their genuine insights and to pass over in silence their sometimes obvious 
(from a modern vantage point) errors. I have done this consciously up to 
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this point. But before turning to the second phase of LE research, І should 
at least mention some of the inadeguacies of the early work: 

1. The early scholars were notoriously poor phoneticians, often display- 
ing a total confusion between the sounds of a language and the letters of an 
alphabet used to represent them. Grimm, for example, considered the Ger- 
man word Schrift ‘writing’ to contain eight sounds, with the letter f repre- 
senting the two sounds ph! In fact, the word contains but five sounds: 
[Srift]. 

2. Despite repeated warnings, Sanskrit was from the start accorded too 
much importance in determining the structure of I-E. Its archaic inflec- 
tional morphology, considered the key area in early work on genetic classi- 
fication, had led many scholars to assume (incorrectly) that Sanskrit was 
comparably old in other areas of linguistic structure as well. Schleicher's 
work on Lithuanian only tarnished the image of Sanskrit as being most 
faithful to I-E structure. Not until the second period of research did it be- 
come clear just how distant Sanskrit was, in some respects, from the origi- 
nal I-E language. 

3. Schleicher's work, in particular, suffered from a rather slavish ad- 
herence to a priori notions of symmetry, beauty, and elegance, which he 
presumed were attributes of the "perfect" proto-language. We have already 
seen that his I-E subgrouping (List 2.3) involved only binary branchings. 
Schleicher's proto-language was itself based on threes: three vowels (1, a, и), 
three places of articulation (p, t, k), and three manners of articulation (t, d, 
dh(= d)). Later scholars were more willing to let the evidence speak for it- 
self, without imposing upon it some preconceived ideal pattern. 

4. From the modern vantage point the most embarrassing of the defects 
of the early period were the related beliefs that (a) languages, like plants 
and animals, pass through periods of growth, maturity and decay; and (b) 
the relative "stage" of a language could be determined by its structural 
type. Schleicher was a major advocate of both views, as was the German 
diplomat Wilhelm von Humboldt. Schleicher believed that throughout pre- 
history (that is, roughly up to the time when I-E was spoken) human lan- 
guages, like species, had been gradually progressing in complexity and so- 
phistication. I-E supposedly represented the "mature" stage of language 
development, followed in the last several thousand years by various de- 
grees of decay and decline in different branches. Languages other than I-E 
were presumed to have advanced at different rates, some having remained 
in a primitive state (e.g. Chinese), while others had progressed to a stage 
intermediate between primitive Chinese and advanced І-Е (e.g. Turkish, 
Hungarian). Whether a language was advanced or primitive depended on 
its structural type, according to which isolating languages like Chinese were 
taken to be the primitive stage of language development, agglutinating lan- 
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guages like Turkish and Hungarian were seen as an intermediate stage, and 
inflectional languages such as І-Е represented the highest stage. 

Both of these theories are today discounted. Without exception, lin- 
guists agree that there are no “primitive” languages extant. Nor are there 
any linguistic “missing links” connecting full-fledged human speech with a 
more primitive communication system. In some ill-defined sense linguists 
consider all human languages currently spoken to be “equal.” Of course it 
is possible to say that French and English have complex vowel systems, that 
Caucasian languages have complex consonant systems, and that Oceanic 
languages often have intricate pronominal systems. It is not possible at 
present to assign a weight to each aspect of linguistic structure in such a 
way that the total complexity of a language could be measured. Nor is it 
clear exactly what such a measure would mean even if it were in fact pos- 
sible to arrive at one. Certainly English, and European languages in gen- 
eral, do not appear to function less effectively than Oceanic languages, 
despite having, by Oceanic standards, a very rudimentary pronominal sys- 
tem. Nor do languages that have many consonants (e.g. Georgian, Nama) 
appear superior in any recognizable way to languages with relatively few 
consonants (e.g. Hawaiian, Sentani). In fact, it is not even clear whether 
having a large consonant inventory is advantageous, detrimental, or irrele- 
vant to the efficient functioning of language. 

To a large extent the preoccupation with the organic growth of language 
was a residue of the earlier concern with the origin of human language. 
During the second period of I-E research, it was realized that human lan- 
guages spoken, say, 5,000 years ago (e.g. I-E) did not differ in any funda- 
mental way from contemporary languages. Meillet (1922b: 469) described 
this fundamental change in outlook after 1875 as follows: "Indo-European 
comparative grammar was no longer concerned with a would-be organic 
period, a period of development about which nothing was known; rather it 
simply continued into a slightly more remote past the research of Romance, 
Germanic, Celtic, Slavic, and Iranian scholars, obtaining the same kind of 
results by means of the same methods." 

The supposed hierarchy from primitive isolating languages through ag- 
glutinating to advanced inflectional languages was entirely the product of a 
pervasive ethnocentrism that colored much of nineteenth-century thought. 
It was then fervently believed by many (as it still is by a few) that European 
languages, customs, religions, and "races" were superior to their counter- 
parts in other regions of the world. Inflectional languages were taken to be 
the highest form of language by definition rather than by demonstration. In 
fact, it turned out that the modern Chinese languages, which were usually 
cited as the chief example of the pure isolating type, derive from an Old 
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Chinese that was apparently inflectional. Тһе contemporary view is that 
this taxonomy is of limited value because many languages do not fall neatly 
into just one category. Moreover, there is no evidence that languages of any 
structural type are more “advanced” than languages of some other struc- 
tural type. 

It is, however, not the poor phonetics, overvaluation of Sanskrit, belief in 
the organic growth of language, or alleged superiority of some languages 
over others that primarily distinguishes linguists of the first period from 
those of the second. Rather, the chief difference lies in their attitude toward 
sound correspondences or, to be more precise, in their attitude toward the 
regularity of such sound shifts. The early scholars never envisaged sound 
correspondences as being in any sense absolute; there were simply too 
many exceptions to even the most famous sound shifts for them to be 
swept under the rug and ignored. For Grimm (as quoted in Robins 1967: 
172), "the sound shift is a general tendency; it is not followed in every 
case." However, a series of revolutionary discoveries, made primarily dur- 
ing the 18705, led many linguists to the conclusion that sound correspon- 
dences exhibit far greater regularity than had previously been recognized. 
A group of young scholars based in Leipzig went so far as to proclaim that 
"sound laws admit of no exceptions" (August Leskien, quoted in Hockett 
1965: 187). Karl Verner's more moderate version of the "regularity hypothe- 
sis" is perhaps a better description of the new spirit: "There must be a rule 
for exceptions to a rule; the only question is to discover it" (quoted in 
Robins 1967: 184). 

This new doctrine of regular sound change triggered an acrimonious de- 
bate between the so-called Young Grammarians (including Karl Brugmann, 
Hermann Osthoff, August Leskien, Vilhelm Thomsen), who championed 
the regularity of sound change, and the Old Guard (especially G. Curtius, 
but also such able younger scholars as Johannes Schmidt, Hermann Collitz, 
and Hugo Schuchardt), who refused to accept the doctrine of absolute 
regularity. This bitter dispute was not resolved by the conversion of one 
side by the other; the hostility involved probably precluded that. In fact, a 
division between proponents and opponents of regular sound change has 
persisted to the present day in attenuated form. Ordinarily a fundamental 
division of any science into two camps must eventually work itself out, for 
otherwise there will develop two versions of the same science, an intoler- 
able (or at least undesirable) state of affairs. There was, however, a curious 
aspect of this particular dispute that made its resolution unnecessary: both 
sides worked as if sound change were regular, and both groups produced 
significant research; both sides recognized that scholars of the early period 
had overlooked a great deal of subtle regularity in the I-E evidence. The 


50 2 EUROPE 


discovery of explanations for seemingly recalcitrant exceptions led the 
Young Grammarians to the belief that all such exceptions were potentially 
explainable, whereas their opponents were willing to grant only that regu- 
larity in the I-E family existed on a level hitherto unnoticed. The acceptance 
of the "regularity hypothesis" in the 1870's (either as an article of faith by 
the Young Grammarians or simply as a working hypothesis by their oppo- 
nents) is considered by Hockett (1965: 188) to be the second great break- 
through in the field of linguistics. 

As an illustration of the work done by the Young Grammarians, let us 
examine one of the most celebrated discoveries of the 1870's. We have al- 
ready seen earlier in this chapter (Table 2.1) how Grimm's Law describes the 
relationship of Germanic consonants to those of other branches of I-E. Ac- 
cording to Grimm's Law, we should expect Latin і and Sanskrit t to corre- 
spond to Germanic Ө, as it does in the word ‘brother’: 


(4) Sanskrit bhrátar- 
Latin frater 
Gothic — bró8ar 


It was recognized from the outset, however, that there are a host of excep- 
tions to Grimm's Law in which b, d, and g are found in place of "expected" 
f, 9, and h, respectively. (The precise phonetic character of b, d, and ç in 
early Germanic is in dispute; voiced fricatives (8,0, y) are a likely possibil- 
ity.) One such example is the word ‘father’: 


(5) Sanskrit pitár- 
Latin pater 
Gothic fadar 


The rather numerous exceptions of this nature did not trouble the early 
linguists like Rask and Grimm, who did not expect sound change to be 
regular. But they did trouble the Young Grammarians, who demanded ex- 
planations for apparent exceptions. In this case the explanation was sup- 
plied by Karl Verner (1877), in the same article in which he had boldly pro- 
claimed that "there must be a rule for exceptions to a rule; the only question 
is to discover it." The explanation Verner offered had to do with the posi- 
tion of the accent in Sanskrit and Greek (and hence also presumably in the 
I-E parent language). Verner noticed that if the Sanskrit and Greek accents 
precede the t, then development was according to Grimm's Law (see (4) 
above), but if the accents did not precede the t, the result was d (see (5) 
above). Verner's Law, as it came to be known, accounted for the exceptional 
appearance of certain b’s and g's as well; it may be thought of as an amend- 
ment to Grimm's Law stipulating that p,t,k have the exceptional outcome 
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b,d,g in just those cases where the I-E accent (as revealed by Sanskrit and 
Greek) did not precede the consonant in question. Pedersen (1931: 282—83) 
stressed the importance of Verner's discovery: 


The effect of Verner's article. . . wasimmeasurable. Now there were no more excep- 
tions to the Germanic sound-shift, and this absence of exceptions necessarily had 
quite as strong an effect upon the whole conception of linguistics as the chief laws 
applying to the sound-shifts had exerted in their time. Then, scholars were begin- 
ning to understand that there were laws of phonology; now, they were awaking to 
the fact that such laws operate regularly. . . . Verner's discovery suggested how ex- 
haustively one must study the phenomena and how accurately one must discrimi- 
nate, to understand the development of sounds. It was, in fact an emphatic warning 
not to regard any such development as a mere whim of language. 


Notwithstanding the instantly recognized brilliance of Verner's insight, it 
would not have had the impact it did, had it been an isolated instance. But 
it was not. Rather, Verner's Law was embedded in a mosaic of other discov- 
eries that pushed linguists to the conclusion that sound change is regular. 
Limitations of space do not permit an exhaustive treatment of the many dis- 
coveries of this fascinating era, but a few more examples seem mandatory 
in dealing with an era so rich in discovery. 

In 1878, the year after the appearance of Verner's article, the Law of Pala- 
tals was published. Originally perceived by Vilhelm Thomsen (but appar- 
ently independently discovered by Verner, Esaias Tegnér, Ferdinand de 
Saussure, Hermann Collitz, and Johannes Schmidt), this law cleared up a 
number of outstanding problems and also laid to rest the notion that San- 
skrit was archaic in all respects. Consider the following words for “and': 


(6) Sanskrit ca 
Latin que 
Greek te 


Schleicher (1861—62) posited a three-vowel system for I-E, parallel to San- 
skrit’s three short vowels: 


(7) i u 
a 


But if I-E had these three vowels and passed them down intact to 
Sanskrit, two problems arise. First, why were some Sanskrit K's palatalized 
to č (the sound in ‘church’), while other Кз were unaffected, before the 
vowel a (cf. Sanskrit ба ‘and’ with Sanskrit kas ‘who’)? Second, why did I-E 
“a change into e in certain words (the same words) belonging to the Euro- 
pean I-E languages? These anomalies were eliminated by the Law of Pala- 
tals, which showed once again how I-E scholarship had been led astray by 
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the overvaluation of Sanskrit. As part of the Law of Palatals it was recog- 
nized that, with respect to the vowel system, it was the European languages 
that preserved the original I-E system: 


(8) i u 


Sanskrit had undergone two sound changes (e > a and o > a), thus ar- 
riving at the (short) vowel system shown in (7), but these mergers took 
place only after the vowel e had palatalized preceding К. Thus Sanskrit ča 
may be traced back to I-E as follows: 


(9) LE “ke > “te > Sanskrit ба 


Тһе Law of Palatals demonstrated clearly the danger of overvaluing опе lan- 
guage at the expense of others, and it also showed how the motivating fac- 
tor in a sound change may itself subsequently disappear through the action 
of other sound changes. 

One final work, by the Swiss scholar Ferdinand de Saussure, should be 
mentioned. In 1878, at the age of 21, Saussure published a brilliant analysis 
of the І-Е vowel system using the methods of the Young Grammarians. In 
this monograph Saussure suggested (among other things) that the І-Е par- 
ent language must have had certain vowels that subsequently disappeared 
from all attested І-Е languages. Using what is today called the method of 
internal reconstruction, Saussure argued that the I-E parent language must 
have had two “sonorant coefficients,” which disappeared after either 
lengthening or changing the timbre of a preceding vowel. Reviewers of 
Saussure's Mémoire suggested that these “sonorant coefficients" were most 
probably consonants rather than vowels. Saussure's bold conjecture was 
never verified during his lifetime, but the discovery of Hittite in the early 
twentieth century revealed an I-E language that had maintained "sonorant 
coefficients" (or "laryngeal consonants," as they came to be called) in at 
least some of the positions that Saussure had predicted. The Polish scholar 
Jerzy Kurylowicz first pointed this out in 1927, 14 years after Saussure's 
death. 

The discovery of the regularity of sound change had several salutary 
effects on the field of linguistics. On the one hand, it both encouraged, and 
was encouraged by, the rapidly developing field of phonetics. At the same 
time, by focusing attention on the residue of exceptions to established 
sound laws, linguists became more aware of other factors affecting the evo- 
lution of language. Among those factors responsible for exceptions to 
sound laws, the following were quickly recognized: 
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1. Loanwords, long the bugaboo of I-E research, were often found to be 
exceptions to general sound laws. Heinrich Hübschmann, beginning іп 
1875, used this new awareness to distinguish the innumerable Iranian loans 
from the true Armenian lexical heritage and to establish the basic sound 
]aws of Armenian. Only when the Iranian loans were filtered out of the vo- 
cabulary was Hübschmann able to demonstrate that Armenian is not a 
member of the Iranian branch of I—E, as previously believed, but rather a 
separate and independent branch. 

2. Тһе disturbing influence of writing or script on regular phonetic devel- 
opment ajso became apparent. 

3. А whole host of sporadic phonetic processes (e.g. assimilation/dis- 
similation at a distance, metathesis, haplology, contamination) were seen 
typically to affect single words rather than the entire vocabulary. 

4. Of all the forces counteracting regular sound change, analogy turned 
out to be one of the most pervasive. Although the influence of analogy on 
language had long been recognized, it is arguable that analogy can only be 
truly appreciated against a background of regular sound change. As long 
as arbitrary sporadic sound changes were tolerated, as they were in the 
early period of linguistic research, the true nature of analogy remained ob- 
scure. To illustrate the role of analogy in linguistic evolution consider the 
following forms: 


(10) Greek phéro Tbear ei-mi ‘Igo’ dído-mi “give 
Sanskrit ^ bhára-mi 'Ibear' | e-mi ‘Igo’ dada-mi | give’ 


In Sanskrit the first-person marker -ті (cognate with English ‘me’) is affixed 
regularly to all verbs; in Greek, however, there is a group of verbs to which 
the suffix -mi is not added (e.g. phérd). In the early period, when Sanskrit 
was considered the most faithful descendant of I-E and I-E was itself con- 
sidered the paragon of human language, the completely regular Sanskrit 
pattern shown in (10) was assumed to have been inherited directly from 
I-E. The problem of when and why Greek had lost this ending in certain 
verbs was unresolved. In 1868 Wilhelm Scherer, a precursor of the Young 
Grammarians, argued that it was in fact Greek that preserved the I-E state 
and Sanskrit that had innovated. He pointed out that all of the European 
branches of I-E, as well as the older Iranian documents, preserved verbs 
that did not take the -ті suffix, at least in part. In Germanic and Slavic the 
class of -mi verbs had also grown at the expense of the suffixless verbs, 
though not so completely as in Sanskrit. From the 1870's onward analogy 
assumed greater importance as its role in leveling out the idiosyncrasies of 
regular sound change was perceived more clearly. 

The demise of the organic theory of linguistic growth and decay, the de- 
valuation of Sanskrit, the growing awareness that linguistic change was far 
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more tightly constrained than had been thought, and a better understand- 
ing of phonetics all necessitated a thorough reevaluation of the results 
of the early period. The older handbooks of Bopp and Schleicher were 
rendered obsolete and had to be replaced, rather than revised, Бу а work 
written from the new perspective. To fill this void Karl Brugmann pub- 
lished, between 1897 and 1916, a two-volume comparative grammar of I-E 
that remains today "the latest full-scale treatment of the family" (Cowgill 
1974: 434). 

During the second period of research, Schleicher's subgrouping of I-E 
(List 2.3 above) underwent several substantial modifications: (1) Armenian 
was demonstrated to be an independent branch of I-E, not a member of the 
Iranian branch; (2) higher-level groupings between branches were elimi- 
nated, with the exception of Indo-Iranian, Balto-Slavic, and Italo-Celtic; 
and (3) in 1880 F. von Bradke divided the I-E languages into two groups, a 
Western group characterized by the retention of initial k- in the word for 
‘hundred’ (e.g. Latin centum) and an Eastern group in which this k- has 
been assibilated to s-, 3-, or si- (e.g. Avestan satem). At the close of the nine- 
teenth century, the subgrouping of the I-E family was conceived to be some- 
thing like that in List 2.4. In this scheme the geographic position of Alba- 
nian seemed anomalous, since it is an Eastern (i.e. satem) language found 
in what is basically Western territory (if one discounts the recent Slavic ex- 
pansion). Prehistorical migration from a more Eastern position was the 
usual explanation, though no supporting evidence was adduced. 


List 2.4. The Indo-European Family (c. 1900) 









INDO-EUROPEAN: 


I EASTERN (= satem): II WESTERN: (7 centum) 





А INDO-IRANIAN: A GERMANIC 
1 INDIC B ITALO-CELTIC: 
2 IRANIAN 1 ITALIC 
B Armenian 2 CELTIC 
C Albanian C GREEK 
D BALTO-SLAVIC: 
1 BALTIC 
2 SLAVIC 


During the first decade of the twentieth century, unexpected archaeolog- 
ical discoveries in Asia revealed two additional branches of I-E, Hittite and 
Tocharian, which are both centum languages in an area (Asia) that was sup- 
posed to know only satem languages. Тһе Eastern-Western (= centum- 
satem) dichotomy of I-E languages dissolved with the discovery of these two 
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extinct centum languages, illustrating the danger of basing genetic classifi- 
cation on a single phonological trait. 

Around the turn of the century, archaeologists discovered documents 
in Chinese Turkestan in a hitherto unknown language. Written in a North 
Indian script (some were even bilingual), and dating from around the sev- 
enth century А.р., these texts revealed two similar languages, now called 
Tocharian А and B, whose I-E affinity was obvious. The proof of affiliation 
was published in 1908 by the Germans Emil Sieg and Wilhelm Siegling. 
Also obvious was the fact that Tocharian constitutes a separate branch of 
I-E. It was probably extinct by the beginning of the second millennium A.D. 

In 1887 a cache of clay tablets, ancient correspondence between Egyp- 
tian kings and Middle East potentates, was unearthed in Egypt. All were 
written in cuneiform, and all but three of the several hundred tablets were 
in Akkadian, a Semitic language used at that time as an international diplo- 
matic language in the region. One letter was in Hurrian, an extinct lan- 
guage of unknown affiliation spoken around 1400 в.с. in what is today 
southern Turkey. The final two letters were in an unknown language. In 
1902 the Norwegian scholar Jorgen Knudtzon published these two letters 
with a translation and critical commentary. He argued that the letters, ad- 
dressed to the king of Arzawa (thought by scholars to have been located 
somewhere in southeast Asia Minor), were in an I-E tongue, possibly the 
language of the Hittites mentioned in the Bible. This proposal was greeted 
with skepticism; unlike Tocharian, the language was not transparently I-E, 
and the evidence was admittedly slim. 

Confirmation of Knudtzon’s conjecture was not long in coming, though it 
arrived by a rather circuitous route. In 1893 a French archaeological expedi- 
tion to what is today central Turkey had discovered 14 tablets in an un- 
known language. The contents of these tablets had been published in Paris 
in 1898. An English scholar, A. H. Sayce, pointed out that the language of 
these tablets was the same as that of the Arzawa letters. Sayce’s observation 
led the German Hugo Winckler to undertake an expedition to the site 
(Bogazkóy, 95 miles [150 km] east of Ankara, Turkey); from 1905 to 1907 
thousands of additional tablets were discovered in what turned out to be 
Hattusa, the capital of the ancient Hittite empire. Dating from roughly 
1550-1200 в.с., these tablets were older than any previously known I-E in- 
scriptions. Decipherment progressed rapidly, and in 1915 the Czech scholar 
Friedrich Hrozny proposed the I-E affinity of Hittite; two years later he 
published the first Hittite grammar. Despite an abundance of tablets (whose 
number would eventually grow to 25,000) skepticism about the I-E nature 
of Hittite continued into the early 1920's. Hittite largely lacked the complex 
inflectional apparatus scholars had come to expect of an I-E Janguage (espe- 
cially an early one), and its vocabulary contained a sizable non-I-E element. 
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Nevertheless, as more and more texts became available in the 1920's, the I-E 
affinity of Hittite gained general acceptance. 

In 1926 the American linguist Edgar Sturtevant proposed that the archaic 
character of Hittite and its remoteness from other branches of I-E were due 
to the fact that Hittite is a sister language of I-E, rather than a daughter lan- 
guage, as had been previously assumed. Thus for Sturtevant and his sup- 
porters, I-E and Hittite (more precisely, the Anatolian group, of which Hit- 
tite is one member) are co-equal branches of Indo-Hittite. Bloomfield (1933: 
64, 312) presents the subgrouping for Indo-Hittite given in List 2.5. Notice 
that in addition to the exclusion of Anatolian from I-E proper, the Italo- 
Celtic group has been discounted. Italic and Celtic had been joined to- 
gether largely (but not exclusively) on the basis of an r- passive. With the 
discovery of the same r- passive іп Tocharian and Hittite, it was realized 
that this particular trait did not indicate any special relationship between 
Italic and Celtic (because it was not shared by them exclusively). 

Sturtevant's Indo-Hittite proposal has met with a decidedly mixed reac- 
tion. Although most linguists today reject it outright (e.g. Jerzy Kurylowicz; 
Jaan Puhvel 1966), a few accept it (Bloomfield 1933: 64; Hockett 1958: 591; 
Cowgill 1975, 1979; Greenberg 1987). The Balto-Slavic unity also remains in 
dispute, but is probably accepted by the majority of scholars. Voegelin and 
Voegelin (1977) present a subgrouping that discounts both the Balto-Slavic 
group and Sturtevant's Indo-Hittite hypothesis (see List 2.6). 


List 2.5. The Indo-Hittite Family List 2.6. The Indo-European Family 


2 SLAVIC 


(Bloomfield 1933) (Voegelin & Voegelin 1977) 
INDO-HITTITE: INDO-EUROPEAN: 
I tANATOLIAN: Hittite, I tANATOLIAN 

Luwian, Palaic, Lydian П Armenian 
П INDO-EUROPEAN: III TTOCHARIAN 
А Armenian IV INDO-IRANIAN: 
B tTOCHARIAN A INDIC 
C INDO-IRANIAN: В NURISTANI 
1 INDIC C IRANIAN 
2 IRANIAN V Albanian 
D Albanian VI GREEK 
E GREEK VII ITALIC 
F ITALIC VIII CELTIC 
G CELTIC IX GERMANIC 
H GERMANIC X BALTIC 
I BALTO-SLAVIC: XI SLAVIC 
1 BALTIC 
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2.8.2 Present Status of Indo-Hittite Classification 


Within the Indo-Hittite family there is scholarly consensus that the fol- 
lowing subfamilies constitute genetic nodes, as defined in section 1.1: 


(11) ANATOLIAN ITALIC 
Armenian ROMANCE 
TOCHARIAN CELTIC 
INDO-IRANIAN GERMANIC 
Albanian BALTIC 
GREEK SLAVIC 


Indo-Hittite (or Indo-European) is itself, of course, a valid genetic group. 
With regard to Indo-Hittite subgrouping, one continuing dispute involves 
the proper position of Anatolian, which is for most scholars simply a coor- 
dinate branch of I-E оп a par with Italic, Celtic, Germanic, Albanian, etc. 
Other scholars, as we have seen, regard Anatolian as coordinate with a 
group containing all other Indo-Hittite languages (i.e. the traditional I-E 
family, including Tocharian). 

The scholarly community remains quite divided on the validity of Sturte- 
vant's Indo-Hittite hypothesis, with supporters a decided minority. Three 
factors have apparently inhibited an objective appraisal of Sturtevant's the- 
sis. (1) Some scholars have undoubtedly been reluctant to replace the tradi- 
tional (and venerated) label Indo-European with the newfangled term 
Indo-Hittite. (2) The fact that Anatolian is attested solely by inscriptions 
from a few languages that have been extinct for millennia has not helped its 
case for being taken as a genetic group coordinate with the far-flung I-E 
family. Whether or not a language (or group of languages) is extinct, or has 
relatively few speakers, has no bearing on its proper position in a genetic 
classification. Notwithstanding, languages with many speakers have al- 
ways received special treatment in the linguistic literature, and sometimes 
unwarranted prominence in genetic classifications. (3) The acceptance of 
Anatolian as a sister language of I-E, rather than simply one of many 
daughters, reduces the traditional I-E family to a taxonomically less signifi- 
cant position. Although it may seem surprising that sober scholars could let 
their judgments be swayed by the mere position of a group of languages in 
a genetic classification, such has in fact been the case on several occasions. 
Some Bantuists, for example, were less than enthusiastic about Greenberg’s 
“demotion” of the huge Bantu family to a modest taxonomic position 
within the Niger-Congo family. 

As we saw in section 2.3.1, there were in the nineteenth century several 
proposals for combining various of the groups listed in (11) into higher- 
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level groupings. Today, however, none enjoys much favor, with the excep- 
tion of Balto-Slavic, which, though still controversial, is quite probably a 
valid genetic unit. The once popular Italo-Celtic, on the other hand, has 
lost support, though Cowgill (1970) suggests that Italic and Celtic were 
neighboring dialects of late Proto-I-E that shared perhaps as few as five 
common innovations before going their separate ways. Indo-Hittite is thus 
currently considered to consist of two groups, Anatolian and I-E, with I-E 
itself comprising nine coordinate branches (see List 2.7). For those who re- 
ject the Indo-Hittite hypothesis, I-E would consist of 10 or 11 first-order 
groups, depending on whether or not Baltic and Slavic are joined in a Balto- 
Slavic family. 


List 2.7. The Indo-Hittite Family (1986) 


INDO-HITTITE: 
I TANATOLIAN 












E GREEK 


II INDO-EUROPEAN: F ITALIC 
А Armenian G CELTIC 
B TTOCHARIAN H GERMANIC 


C INDO-IRANIAN 
D Albanian 


I BALTO-SLAVIC 


Let us turn now to the internal subgrouping of the families listed in (11) 
above. For the small Anatolian group the internal structure is not well 
understood. Although it is clear that Luwian (both hieroglyphic and cunei- 
form) and Lycian form one unit, the relationship of this group to Hittite, 
Palaic, and Lydian is unclear. Questions of subgrouping do not arise for 
Armenian and Albanian (single languages) or Tocharian (two languages). 

Indo-Iranian consists of three well-defined subfamilies: Indic, Nuristani, 
and Iranian. Within Indic the Sinhalese-Maldivian group and Romany (the 
language of the gypsies) stand apart from the rest of the Indic languages, 
between many of which there are transitional dialects making further sub- 
grouping difficult, if not impossible. (Dardic, however, seems to be a valid 
genetic group at some level.) Nuristani represents a small group of lan- 
guages intermediate between Indic and Iranian. Within Iranian there is a 
fairly clear-cut division between Eastern and Western branches, but little 
consensus on further subgrouping. 

In antiquity Greek constituted a single fairly homogeneous language, 
with a few aberrant dialects. Modern Greek is a continuation of a koine 
based on the Attic dialect of classical times, which succeeded in obliterating 
all other dialects (e.g. Arcadian, Aeolic, Locrian, Ionic, Mycenaean) except 
for Tsakonian, which continues the Laconian dialect of ancient Sparta. 
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Tsakonian is apparently moribund, retaining no more than 10,000 speakers 
on the east coast of the Peloponnesus. 

The internal subgrouping of the Italic group is not well understood; 
some of the languages are not even clearly Italic. It is clear that of the origi- 
nal Italic group (whatever its constituents may have been) only a single lan- 
guage, Latin, survived. Latin has survived, of course, in the form of the 
modern Romance languages. Within Romance there is a clear distinction be- 
tween Eastern (Rumanian) and Western (Italian, French, Catalan, Spanish, 
Portuguese) groups. The chief classificatory problems involve the position 
of the heterogeneous Sardinian dialects and the extinct (since 1898) Dalma- 
tian language, formerly spoken on the northwest coast of Yugoslavia. Dal- 
matian has been classified as both Eastern and Western by various scholars 
(and sometimes even by the same scholar at different stages of his career). 
It shares similarities with both Rumanian and Italian, and linguists do not 
yet agree on which of the two it is most closely related to. The position of 
Sardinian is even more problematical, since it does not obviously belong to 
either Eastern or Western Romance; Hall (1974) considers it coordinate with 
the rest of Romance. Leaving aside Dalmatian, Eastern Romance is itself 
clearly divided into Northern and Southern groups, with Daco-Rumanian 
(7 Standard Rumanian) and Istro-Rumanian constituting the Northern 
group and Arumanian and Megleno-Rumanian the Southern group. With- 
in most of Western Romance there are (or at least were until fairly recently) 
transitional dialects between contiguous languages (e.g. Italian-French- 
Provencal-Gascon-Catalan-Spanish-Galician-Portuguese) that make further 
subgrouping somewhat arbitrary. Catalan, for instance, has, been included 
in both the Ibero-Romance and Gallo-Romance groups by different scholars. 

Celtic languages were once spoken throughout Europe (including 
present-day Spain, France, and Italy) and into Asia Minor (Turkey), but all 
of these Continental and Asian languages became extinct early on, with 
little known about them save their existence. These extinct Celtic languages 
are usually placed in a Continental subgroup to distinguish them from the 
four surviving Celtic languages, which belong to an Insular group origi- 
nally restricted to the British Isles. The Insular group itself has two clear 
subgroups: the Goidelic group, consisting of Irish (= Gaelic), Scottish 
Gaelic, and the extinct Manx (spoken on the Isle of Man into this century), 
and the Brythonic group, consisting of Breton, Welsh, and the extinct Cor- 
nish. Though currently spoken on the northwest coast of France (in Brit- 
tany), Breton is not the survivor of an earlier Continental Celtic tongue, but 
rather represents a reintroduction of Celtic speech from the British Isles, 
believed to have taken place around A.D. 500. Cornish was formerly spoken 
in Cornwall, England, but was gradually eroded by English, becoming ex- 
tinct during the eighteenth century. 
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Germanic consists of three subgroups: East, North, and West. АП the lan- 
guages of the East group (e.g. Gothic, Vandalic, Burgundian) are now ex- 
tinct. The North group is made up of the Scandinavian languages (Danish, 
Swedish, and Norwegian; each is to a large extent mutually intelligible with 
the other two), Icelandic, and Faeroese, the latter spoken on the Faeroe Is- 
lands north of Great Britain. West Germanic languages include German, 
Luxembourgeois, Dutch, English, and Frisian. Luxembourgeois is one of 
three official languages in Luxembourg; it is reported to be as distinct from 
Standard German as Dutch. Spoken in Holland and Germany, Frisian con- 
sists of three sharply divergent dialects (East, West, and North), no doubt 
distinct languages if the criterion of mutual intelligibility were rigorously 
applied. Afrikaans and Yiddish are offshoots of Dutch and German, re- 
spectively, and Flemish is a dialect of Dutch. Certain scholars (E. Schwarz 
and W. Lehmann) believe that the North and East groups form a higher- 
level grouping, so that the original schism in Germanic was into Western 
and Eastern groups, with the latter subsequently dividing into North- 
ern and Eastern subgroups. 

The Baltic group consists of two subgroups. Old Prussian, extinct since 
the seventeenth century, made up the West Baltic subgroup. East Baltic 
consists of the two extant Baltic languages, Latvian and Lithuanian, and 
several extinct tongues. 

Like Germanic, the Slavic family has three distinct branches. Russian, 
Byelorussian, and Ukrainian belong to East Slavic. West Slavic comprises 
Polish, Czech, Slovak, and several minor tongues. Czech and Slovak are 
mutually intelligible. South Slavic comprises four languages: Slovene, 
Serbo-Croatian, Macedonian, and Bulgarian. Slovene consists of a number 
of sharply differentiated dialects scattered amid the mountains of northern 
Yugoslavia. The other three are fairly homogeneous. Some scholars believe 
the primary division was into East and West Slavic, with South Slavic being 
a subsequent offshoot of West Slavic. 
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2.4 URALIC-YUKAGHIR 


The Uralic-Yukaghir family, comprising two dozen languages and spoken 
by some 22 million people, is located for the most part in northern Eur- 
asia, extending from Scandinavia across the Ural mountains into northern 
Asia, as shown in Map 2.2. The most divergent language of the group 
is Yukaghir, which has several hundred speakers in Northeast Asia. The 
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Uralic family itself has two well-defined branches. Тһе Samoyed group, situ- 
ated on both sides of the Ural mountains іп northern Eurasia, has four ех- 
tant languages and perhaps 30,000 speakers. The second branch, the Finno- 
Ugric family, with 19 languages and about 22 million speakers, also con- 
tains two branches. The Ugric family is composed of two languages spoken 
just east of the Ural mountains in western Asia (Kanty, Mansi) and Hun- 
garian, a Uralic island in central Europe surrounded on all sides by Indo- 
European languages. Hungarian's 14 million speakers account for more 
than half of all Uralic speakers. The second branch of Finno-Ugric is the 
Finnic family, most of whose languages are found in the Baltic region (Fin- 
land, Estonia, northern Scandinavia), though some Finnic languages are 
found on the western side of the Ural mountains. The best known Finnic 
languages аге Finnish (with five million speakers), Estonian (with а million), 
and various Saami (= Lapp) languages (with around 30,000 speakers) 
spoken in northern Scandinavia and on the Kola Peninsula of the U.S.S.R. 


2.4.1 History of Uralic-Yukaghir Classification 


The first mention of a Uralic people in the historical record occurs in Taci- 
tus's Germania (A.D. 98), where the Finns are described as inhabiting a re- 
gion northeast of the Germanic territory. The earliest recorded observation 
of similarities between two Uralic languages is usually attributed to the 
Norwegian landowner Othere, who noted resemblances between Lapp and 
Karelian around A.p. 890. The Samoyed peoples are first mentioned in 
Nestor's Chronicle (c. 1113). The western, or European, Samoyeds came 
under Russian rule during the twelfth century; the Asian, or trans-Uralic, 
Samoyeds remained outside the Russian sphere until the fourteenth cen- 
tury. Recognition that the Samoyed languages are related to the Finnic lan- 
guages did not come until the early eighteenth century. 

At the end of the fifteenth century, European scholars noticed the resem- 
blance between Hungaria and Yugria (a group of settlements just east of the 
Ural Mountains). A Ugric connection was assumed at this time, but no lin- 
guistic evidence was adduced. 

In 1671 a Swedish scholar, Georg Stiernhielm, remarked that Estonian 
and Lapp resemble Finnish in numerous ways and that Finnish, in turn, 
shares many words with Hungarian. At about the same time a German 
scholar, Martin Vogel, compared Finnish, Lapp, and Hungarian in an at- 
tempt to prove a relationship among them, but his work was never pub- 
lished. These two scholars were apparently the first to perceive the outlines 
of what has since come to be known as the Finno-Ugric family. 

The year 1717 saw two publications of importance for Uralic classifica- 
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Поп. First, а Swedish professor, Olaf Rudbeck, proposed about 100 ety- 
mologies linking Hungarian with Finnish. According to Collinder (1965: 
34), some 40 of these etymologies are still considered valid. The second im- 
portant publication was G. W. Leibniz's Collectanea Etymologica, in which the 
German scholar J. G. von Eckhart proposed a relationship among Finnish, 
Estonian, Livonian, Hungarian, Xanty, and Samoyed, thus for the first time 
linking the Samoyed languages with the Finno-Ugric group previously rec- 
ognized by Stiernhielm. This larger grouping is today known as the Uralic 
family. 

In 1730 the Swedish army officer Phillip Johann von Strahlenberg pub- 
lished a work in which he correctly delineated, for the first time, all the 
branches of the Finno-Ugric family except Lapp. In 1770 a German histo- 
rian, August Ludwig von Schlózer, added Lapp to Strahlenberg’s classifica- 
tion. At this point all the constituent branches of Finno-Ugric were known, 
as was the connection between the Finno-Ugric and Samoyed languages. 
The basic structure of the Uralic family had thus been roughly worked out 
at least six years before William Jones's celebrated address, which opened the 
era of I-E studies. 

The fact that a few scholars had discovered the linguistic relationships of 
the Uralic languages by the close of the eighteenth century does not mean 
that this knowledge was either widespread or widely believed. Hungarian 
intellectuals of the time, for instance, fancied that the Turkic tribes in the 
east were their nearest relatives and had little use for rumors of similarities 
with Baltic or Siberian languages. In Hungary, at least, the wild unfettered 
Romanticism of the epoch chose to ignore the sober linguistic facts. 

Despite the hostile climate, however, іп 1770 the Hungarian Jesuit J. 
Sajnovics attempted to show, by comparing inflectional endings and lexical 
similarities, that Hungarian and Lapp are related. Since the Saami are ra- 
cially distinct from the Hungarians (and from the Baltic peoples as well), 
and since they occupy what would today be called a modest position on the 
socioeconomic scale, many Hungarians were outraged at Sajnovics's pro- 
posal, which was condemned "as an impious act of sacrilege and regarded 
by a so-called ‘patriotic public’ as a slight on the nation's honour. Preference 
was given to authors less inclined to dissipate illusions." (Zsirai 1951: vi). 

In 1799 another Hungarian, Sámuel Gyarmathi, who had been both in- 
fluenced and encouraged by the German Schlózer, published the most sig- 
nificant work on Uralic languages up to that time. Gyarmathi's work repre- 
sents an advance over Sajnovics' s, in that he took into consideration all of 
the Uralic languages about which he had information (including Samoyed) 
and did not artificially restrict his investigation to only two languages. 
Gyarmathi established that (1) Hungarian's closest relatives are Xanty and 
Mansi, the three of which form a Ugric subfamily; (2) this Ugric group 
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shows numerous affinities with the Finnic family; and (3) both display simi- 
larities with the Samoyed languages. Although many of Gyarmathi's pro- 
posed etymologies were later discredited, it nevertheless remains true that 
"a substantial proportion of the accepted . . . Finno-Ugrian etymologies 
was established for the first time by Gyarmathi" (Zsirai 1951: xiv). 

At the beginning of the nineteenth century, the classification of Uralic 
languages was much better understood than that of the I-E family. Further- 
more, Uralic relationships had been established by the same methods later 
employed by the early Indo-Europeanists, namely, the discovery of lexical 
similarities (Strahlenberg) and inflectional similarities (Sajnovics, Gyar- 
mathi). Why, then, was comparative linguistics in the nineteenth century 
developed almost entirely in terms of the I-E family, with the Uralic heri- 
tage being largely ignored? Pedersen (1931: 241) suggests two principal rea- 
sons: "The Finno-Ugric languages were too far beyond the horizon and in- 
terest of most European scholars, and, besides, the problems to be solved 
were too difficult because the languages are very distantly related, and lack 
old documents." I believe Pedersen's first cause, a lack of interest on the 
part of the European community, was the crucial factor. The Uralic lan- 
guages are in fact not as distantly related as Pedersen implies, and as we 
saw above, they had already been fairly accurately classified by the end of 
the eighteenth century. Their major handicap was that practically with- 
out exception the Uralic languages are "marginal," both geographically and 
politically. The sole exception was Hungarian, but in Hungary the intellec- 
tual climate was hostile to the development of Finno-Ugric comparative 
linguistics. | 

During {һе nineteenth century there were по major revisions іп Uralic 
classification, though some details were clarified and а growing body of evi- 
dence supported the classifications of earlier scholars. In 1800 the Samoyed 
languages were the least wel] known, and their precise position within 
the Uralic family had not yet been firmly established. Although the first 
Samoyed grammar appeared in 1811, the Samoyed languages remained 
poorly documented until the work of the Finnish linguist M. A. Castrén. 
Castrén visited all the Samoyed-speaking areas during a series of field trips 
in the 18405 and classified the various Samoyed dialects into five languages, 
divided into Northern and Southern subgroups. In addition to clarifying 
the internal relationships of the Samoyed family, Castrén was the first 
scholar to show that the fundamental division in Uralic is between the 
Samoyed languages and all the rest of the family, i.e. Finno-Ugric. Because 
of this discovery Castrén is sometimes referred to as the founder of Uralic 
linguistics. His most important work, a comparative grammar of the 
Samoyed languages, appeared in 1854, two years after his death. 

In the final decades of the nineteenth century, two additional scholars 
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consolidated and expanded the knowledge of the Uralic family. Тһе first of 
these was а German, József Budenz, who spent the last 20 years of his Ше 
as the leading Finno-Ugric specialist in Hungary. In his major work, a com- 
parative dictionary of the Finno-Ugric elements in Hungarian, Budenz pro- 
posed close to 1,000 correspondences between Hungarian and various Fin- 
nic languages and laid to rest forever the erroneous notions concerning 
Hungarian's closest relatives. Though Budenz concentrated primarily on 
Finno-Ugric languages, he did avail himself of the opportunity to elicit five 
stories and some sentences from a Samoyed family of five that was being 
erhibited at the Budapest zoo in 1882. 

А final scholar of importance was the Hungarian linguist Ignác Halasz. 
In a work published in 1893—94 Halász proposed 245 Uralic cognates and 
noted numerous grammatical agreements between the Samoyed and Finno- 
Ugric families. His work has been called “the first thorough proof of the 
Samoyed-Finno-Ugric relationship" (Hajdü 1963: 87). 


2.4.2 Present Status of Uralic-Yukaghir Classification 


Within the Uralic family the following groups of languages are believed 
to constitute valid genetic nodes: 


(12) SAMOYED FINNIC 
FINNO-UGRIC РЕКМІС 
UGRIC SAAMIC (- LAPPIC) 


OB-UGRIC BALTIC FINNIC 


Some linguists would add an additional group, Volgaic, made up of the two 
languages Mari and Mordvin (Austerlitz 1968, Sauvageot & Menges 1973, 
Haarmann 1974, Voegelin & Voegelin 1977). Other scholars, however, deny 
any special relationship between these two languages (Collinder 1965, 
Harms 1974). 


List 2.8. The Uralic Family (1986) 


URALIC: 
I SAMOYED: 
А NORTH 
B SOUTH 
П FINNO-UGRIC: 
А UGRIC: 
1 Hungarian 
2 ОВ-ОСКІС 
B FINNIC 
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List 2.9. Proposed Subgroupings of the Finnic Family 


Collinder 1965 Austerlitz 1968 Sauvageot & Menges 1973 
FINNIC: FINNIC: FINNIC: 
I Mordvin I PERMIC I BALTIC FINNIC 
II Mari II VOLGAIC II SAAMIC 
III BALTIC FINNIC Ш NORTH FINNIC: III VOLGAIC 
IV SAAMIC A. SAAMIC IV PERMIC 
V PERMIC B BALTIC FINNIC 
Harms 1974 Voegelin & Voegelin 1977 
FINNIC: FINNIC: 
I PERMIC I PERMIC 
II Mari II FINNO-VOLGAIC: 
III WEST FINNIC: A VOLGAIC 
A. Mordvin B FINNO-LAPPIC: 
B NORTH FINNIC: 1 SAAMIC 
1 SAAMIC 2 BALTIC FINNIC 


2 BALTIC FINNIC 


The subgrouping of the units in (12) is at the highest levels quite uncon- 
troversial, with practically all scholars supporting the subgrouping for 
Uralic in List 2.8. Divergence of opinion is presently restricted to the Finnic 
subfamily of Uralic. Here there is, first, disagreement over the validity of 
the Volgaic group, and second, different proposals for subgrouping the 
constituent branches of Finnic (Baltic Finnic, Saamic, Permic, and either 
Volgaic or Mari and Mordvin). List 2.9 shows five proposed subgroupings 
for the Finnic languages. The classification of Finnic adopted in this book 
follows Austerlitz (1968). 

The Yukaghir language is spoken in northeastern Siberia by several hun- 
dred people. Though it is not a Uralic language stricto sensu, it "is obviously 
related to the Uralic languages" (Collinder 1955: xii). Yukaghir is often con- 
sidered a language isolate, that is, a language not known to be related to 
any other language (see Voegelin & Voegelin 1977: 355). Yet the evidence 
offered in Collinder (1940, 1965) and Harms (1977) demonstrates beyond 
doubt the affinity of Yukaghir with the Uralic family. Collinder (1965: 30) 
summarizes the evidence as follows: 


The features common to Yukaghir and Uralic are so numerous and so characteristic 
that they must be remainders of a primordial unity. The case system of Yukaghir is 
almost identical with that of Northern Samoyed. The imperative of the verbs is 
formed with the same suffixes as in Southern Samoyed and the most conservative of 
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the Fenno-Ugric languages. . . . There are striking common traits in verb deriva- 
tion. Most of the pronominal stems are more or less identical. Yukaghir has half a 
hundred words in common with Uralic, in addition to those that may fairly be sus- 
pected of being loanwords. . . . It is worth noting that all the Fenno-Ugric lan- 
guages deviate more from Samoyed in their case inflection than Yukaghir does. 


It thus appears that Yukaghir, and the closely related, but extinct, Chu- 
vantsy and Omok, are taxonomically coordinate with the entire Uralic fam- 
ily (see List 2.10). 


List 2.10. The Uralic-Yukaghir Family 
(Collinder 1965) 


URALIC-YUKAGHIR: 
I YUKAGHIR 
II URALIC: 
A SAMOYEDIC 
B FINNO-UGRIC: 
1 UGRIC 
2 FINNIC 
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2.5 CAUCASIAN 


Approximately three dozen Caucasian languages are spoken in the Cau- 
casus of the U.S.S.R. by some five million people. The geographic distri- 
bution of the Caucasian family is shown in Map 2.3. The Caucasian family 
is composed of two very different subfamilies, and it is possible that these 
two subfamilies, North and South Caucasian, do not constitute a valid ge- 
netic node. Georgian, with 3.3 million speakers, is the only well-known 
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Caucasian language. One North Caucasian language, Kabardian, has at- 
tained a certain linguistic notoriety as a language with few vowels and nu- 
merous consonants. 


2.5.1 History of Caucasian Classification 


I have been unable to determine precisely who it was that first recog- 
nized, and offered evidence for, the various genetic groups constituting the 
Caucasian family. As a consequence, the following brief historical perspec- 
tive will be a good deal less satisfying than I would like. It will consist 
mainly of an enumeration of scholars who have studied, and made some 
contribution to, Caucasian linguistics, but the contribution of each with re- 
gard to genetic classification will remain regrettably vague. 

The earliest mention of the multiplicity of languages spoken in the Cau- 
casus Mountains was made by the Greek geographer Strabo around the be- 
ginning of the Christian era. But these languages attracted very little schol- 
arly attention until the late eighteenth century, at which time the region 
was coming increasingly under Russian control. 

The first person to study the Caucasian languages in a systematic way 
was a German doctor, Johann Anton Güldenstádt. During the second half 
of the eighteenth century, Güldenstádt published information on the di- 
verse North Caucasian languages and proposed a classification of them that 
was not, according to Vogt (1942: 245), "very different from that currently 
accepted." 

The German linguist Heinrich Julius Klaproth continued Güldenstádt's 
work in the early nineteenth century, and a Frenchman, Marie-Félicité 
Brosset, undertook the study of Georgian, a South Caucasian language and 
the only Caucasian language with a lengthy literary history (dating from 
the fifth century A.D.). 

During the second half of the nineteenth century, Caucasian languages 
were studied with increasing frequency, and many previously undocu- 
mented languages received their first published treatments. In this regard 
one must mention the Russian orientalist Franz Anton Schiefner, who 
from 1854 to 1873 published descriptions of eight Caucasian languages. 
During this same period the Russian officer Peter Karlovich Uslar, who had 
earlier taken part in the military campaigns in the Dagestan region, devoted 
himself to the study of Caucasian languages, producing grammatical de- 
scriptions of six previously unknown languages. Uslar is sometimes re- 
garded as the "father and founder of Caucasian linguistics" (Vogt 1942: 
246); according to Sala and Vintilá-Rádulescu (1981: 54), he was the first to 
propose that all Caucasian languages share a common origin. 
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Тһе final work of the nineteenth century оп Caucasian languages was 
published in 1895 by another former officer in the Russian army, R. von 
Eckhart. This general survey of Caucasian languages contained information 
on a greater number of Caucasian languages than earlier works and was 
guite successful in Europe even though it was not highly regarded by 
scholars. 

The most important scholar of the early twentieth century was the Ger- 
man Adolf Dirr, who from 1904 to 1913 published monographs on nine 
North Caucasian languages, mostly of the Dagestan region. In 1924 Dirr 
founded the first linguistics journal devoted to Caucasian languages, Cau- 
casia, and in 1928 he published an introduction to Caucasian linguistics 
containing sketches of 35 languages. 


2.5.2 Present Status of Caucasian Classification 


Whether or not all Caucasian languages derive from a single source has 
never been resolved to the satisfaction of most linguists. There is no gues- 
tion that the South Caucasian, or Kartvelian, languages form a valid genetic 
group. Furthermore, most linguists accept that the North Caucasian lan- 
guages constitute a valid family with two branches, a compact North- 
western group and a distantly related, and more heterogeneous, North- 
eastern group. The principal takonomic dispute in the twentieth century 
has been whether the North Caucasian and South Caucasian families share 
a common ancestor. Although certain scholars have supported a single 
Caucasian family (e.g. Friedrich Müller, А. Trombetti, Hans Vogt, and most 
Soviet linguists), others remain unconvinced. The Russian linguist N. S. 
Trubetzkoj (1922: 185) considered the attempt to connect North and South 
Caucasian to be “fantasy” so long as regular sound correspondences had 
not been established between the two families. 

A second area of disagreement has to do with the relationship of the Nax 
group to other Northern subgroups. There have been three proposals: 
(1) the Nax group is intermediate between Northeast and Northwest Cau- 
casian and thus constitutes one of three primary branches of North Cauca- 
sian (Catford 1977, B.G. Hewitt in Comrie 1981); (2) the Nax group forms 
one of two primary branches of Northeast Caucasian (Eckhart 1895, Dirr 
1928, Dumezil 1952, Klimov 1965, Gamkrelidze & Gudava 1974), the other 
branch containing the remainder of the Northeastern languages; and (3) the 
Nax group is simply one of several Northeastern subgroups (Trubetzkoj 
1922, Deeters 1963, Voegelin & Voegelin 1977). These three subgroupings 
are illustrated in List 2.11. The classification adopted in this book follows 
subgrouping 2. 
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List 2.11. Proposed Subgroupings of the North Caucasian Languages 


I П ILI 
NORTH CAUCASIAN: NORTH CAUCASIAN: NORTH CAUCASIAN: 
I NORTHWEST І NORTHWEST І NORTHWEST 
II NAK II NORTHEAST: H NORTHEAST: 
HI NORTHEAST A NAX A NAX 
B DAGESTAN: B AVARO-ANDI- 
1 AVARO-ANDI-DIDO DIDO 
2 LAK-DARGWA C LAK-DARGWA 
3 LEZGIAN D LEZGIAN 


Ignoring low level genetic units whose members are sometimes treated 
as dialects of one language, other times as distinct languages (e.g. Mingre- 
lian-Laz, Abxaz-Abaza, Chechen-Ingush, Bezhta-Hunzib), the following 
Caucasian groups are fairly well established as valid genetic units: 


(13) SOUTH CAUCASIAN AVARO-ANDI-DIDO 
NORTH CAUCASIAN ANDI 
NORTHWEST CAUCASIAN DIDO 
NORTHEAST CAUCASIAN |. LAK-DARGWA 
NAX LEZGIAN 


Dagestan may be added to this list only if subgrouping 2 in List 2.11 above 
is correct. The Caucasian classification used in this book has the structure 
given in List 2.12. 

Several scholars have attempted to relate the language isolate Basque, 
spoken in northern Spain, with Caucasian languages. The putative rela- 


List 2.12. The Caucasian Family 
(Gamkrelidze & Gudava 1974) 


CAUCASIAN: 


A NORTHWEST 
B NORTHEAST: 
1 NAX 
2 DAGESTAN: 
a AVARO-ANDI-DIDO 
b LAK-DARGWA 
c LEZGIAN 
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tionship would Ре а very distant опе, апа the majority of scholars remain 
skeptical. For evidence supporting the connection, the reader should con- 
sult Bouda (1948, 1949), Lafon (1933, 1952b), and Michelena (1968). 
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Э 
Africa 


There are many more languages іп Africa than іп Europe, and their classifi- 
cation has been comprehensively worked out only in this century. The 
three European phyla discussed in the previous chapter contain some 200 
languages, almost half of which are really spoken in Asia (primarily the 
Indo-Iranian branch of Indo-European). The four phyla into which African 
languages are currently classified comprise close to 1,500 languages. Lan- 
guages of the Afro-Asiatic family are spoken across a wide belt covering 
most of the northern third of Africa and ertending into western Asia. More 
than 1,000 Niger-Kordofanian languages are spoken throughout most of the 
southern two-thirds of Africa, with the large Bantu subfamily occupying 
most of the southern half of the continent. Although one Nilo-Saharan lan- 
guage (Songhai) is spoken in western Africa (i.e. Mali, Burkina Fasso, 
Niger), most Nilo-Saharan languages are found in central and east-central 
Africa. Khoisan languages were probably originally spoken over most of the 
southern third of Africa. However, the Bantu expansion from the north- 
west and the European occupation of the South have overwhelmed and ex- 
tinguished most of these languages. Several dozen Khoisan languages are 
still spoken by small groups in South Africa, Namibia, Botswana, and An- 
gola, where the dominant languages are either Bantu (e.g. Zulu, Xhosa) or 
Indo-European (e.g. Afrikaans, English). The two most divergent Khoisan 
languages are in fact found in northern Tanzania. 


3.1 THE HISTORY OF AFRICAN CLASSIFICATION 


Our earliest records of Sub-Saharan languages are Arabic documents 
dating from the tenth to twelfth centuries. Although a few wordlists of Af- 
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rican languages were collected during the Middle Ages, it was not until the 
fifteenth and sixteenth centuries that Europeans began to gather such in- 
formation in any quantity. The seventeenth century witnessed a flowering 
of scholarship on African languages, including dictionaries of Coptic (1636), 
Nubian (1638), and Kongo (1652) and grammars of Nama (1643),* Kongo 
(1659), Geez (1661), and Amharic (1698), among others. 

During most of the eighteenth century African scholarship ebbed, but in 
the last quarter of the century, two African language families were clearly 
discerned. In 1776 a French abbé, Lievin Bonaventure Proyart, recognized 
that three Bantu languages must derive from a common source, which, 
however, he incorrectly assumed still existed. It is likely that the En- 
glishman William Marsden perceived the Bantu group as early as 1778, 
though the first written record to this effect dates from 1816. In the early 
nineteenth century several more scholars independently discovered the 
Bantu group. The early recognition of Bantu is not surprising, since Bantu 
is a large group of closely related languages that covers most of the south- 
ern third of Africa. Because the Bantu family is about as closely related as 
the Romance group (cf. Table 1.2), it can hardly be missed by anyone who 
examines even a few Bantu languages. (The term Bantu, however, was not 
coined until 1858.) 

The other linguistic group to be recognized in the eighteenth century 
was the Semitic family. The German scholar Ludwig von Schlózer is often 
credited with having recognized, and named, the Semitic family in 1781. 
But the affinity of Hebrew, Arabic, and Aramaic had been recognized for 
centuries by Jewish and Islamic scholars, and this knowledge was pub- 
lished in Western Europe as early as 1538 (see Postel 1538). Around 1700 
Hiob Ludolf, who had written grammars of Geez and Amharic (both Ethi- 
opic Semitic languages) in the seventeenth century, recognized the exten- 
sion of the Semitic family into East Africa. Thus when von Schlózer named 
the family in 1781, he was merely recognizing genetic relationships that had 
been known for centuries. Three Semitic languages (Aramaic, Arabic, and 
Hebrew) were long familiar to Europeans both because of their geographic 
proximity and because the Bible was written in Hebrew and Aramaic. 

During the first half of the nineteenth century, two additional families 
were postulated. In 1808 Heinrich Lichtenstein divided the languages of 
Southern Africa into two groups, Bantu and Nama. The first attempt at an 
overall classification of African languages was published in 1826 by Adrien 
Balbi. Balbi connected Nama with the Bushman languages, as did James C. 
Prichard (1826). The principal distinction between the Nama and the Bush- 
men is cultural, not linguistic. The Nama are cattle-herders; the Bushmen 
are hunters and gatherers. Both groups, however, share a distinct racial 


“Тһе name Hottentot is felt to be pejorative and has therefore been replaced by Nama 
throughout this work. 
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type, sometimes called Bushmanoid, which distinguishes them rather 
clearly from the surrounding Negroid Bantu speakers. Furthermore, the 
languages of the Nama and Bushmen both use clicks. It is likely that such 
genetically (in the linguistic sense) irrelevant traits as racial type and clicks 
were at first the major criteria for connecting Nama and the Bushman lan- 
guages. The connection turned out to be correct, but the supporting lin- 
guistic evidence did not come untillater. 

Also widely recognized by 1850 was a group of languages known as 
Hamitic, which was usually taken to include Ancient Egyptian (now ex- 
tinct) and the Berber and Cushitic families. Furthermore, most scholars ad- 
mitted by then that the Hamitic languages showed such striking affinities 
with the Semitic family that a common origin could hardly be doubted. The 
overall family became known in the 1860's as Hamito-Semitic. 

Thus at midcentury there were three recognized language families in Af- 
rica: Bantu, Nama-Bushman, and Hamito-Semitic. Unfortunately there 
were still hundreds of languages that did not appear to fit into any of these 
families. This rather substantial residue was more often than not lumped 
into a single ill-defined group with a name like “Negroland” or “Mid- 
African.” Speakers of these languages were for the most part Negroid and 
were situated geographically in the middle of Africa, in a belt running from 
coast to coast, with Hamito-Semitic speakers to the north and Bantu and 
Nama-Bushman speakers to the south. Relationships among these numer- 
ous and heterogeneous languages were not at all clear. Some scholars сап- 
didly admitted that the Negro group was based primarily on physical traits 
rather than linguistic similarities. In revising Prichard's Natural History of 
Man (1843), the Englishman Edwin Norris wrote in 1855 that “hitherto the 
Negroes have been deemed to be one race from physiological rather than 
philological evidence” (quoted in Cole 1971: 21). 

During the second half of the nineteenth century, two influential classifi- 
cations of African languages appeared, both first published in the 1860's 
and revised periodically by their authors over the next several decades. The 
German Egyptologist Karl Richard Lepsius published various versions of 
his classification between 1863 and 1880. His final classification, from his 
Nubische Grammatik (Nubian Grammar), is given in List 3.1. 

The other important classification was the work of an Austrian linguist, 
Friedrich Müller, who offered a complete classification of the world's 
languages in his Grundriss der Sprachwissenschaft (Outline of Linguistics). 
Müller's classification appeared first in 1867, with revised versions as late as 
1888. The African portion of his classification had six subdivisions (see List 
3.2). Of these six groups, Müller believed that only Semitic and Bantu were 
valid genetic units, with the status of the others in doubt. Müller acknowl- 
edged that Semitic and Hamitic were related at some level, but the precise 
relationship remained unclear. 
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List 3.1. African Language Families List 3.2. African Language Families 
(Lepsius 1880) (Muller 1877) 
SEMITIC SEMITIC 
HAMITIC: НАМІТІС: 
I TAncient Egyptian I TAncient Egyptian 
H CUSHITIC П CUSHITIC 
III BERBER-HAUSA HI BERBER 
I BANTU 
П MIXED NEGRO (all the BANTU 
rest) NAMA-BUSHMAN 


The chief differences between Lepsius and Müller were the following. 
(1) To the traditional Hamitic group (see List 3.2) Lepsius added other 
languages not usually considered Hamitic: Hausa (which he linked with 
Berber) and Nama. (2) Lepsius lumped the Negro and Bantu groups into 
one family, whereas Müller kept them separate. (3) From the Negro family 
Müller extracted a heterogeneous Nuba-Fula group, which subsequent 
scholarship proved to be totally unjustified. Both Lepsius and Müller used 
nonlinguistic criteria, such as skin color, hair type, and mode of subsis- 
tence, in establishing their classifications. Lepsius, in addition, used ty- 
pological traits, such as the presence or absence of gender, which do not 
lead to valid genetic groups. 

During the first half of the twentieth century, two Germans dominated 
work on African linguistics, particularly in matters of classification. The 
first was the Bantuist Carl Meinhof; the second was his student Diedrich 
Westermann. Unlike their nineteenth-century predecessors, neither ever 
proposed a complete classification of African languages, but their views on 
most of the vital issues in language grouping can be inferred from their 
published works. 

Meinhof made his reputation in Bantu linguistics and then used that au- 
thority to dominate the entire African field for close to half a century. In 
1910 he proposed that the ill-defined Mid-African, or Negro, group be re- 
named Sudanic, and he expressed the belief that all these languages would 
one day be shown to be related. He did not claim to have demonstrated the 
relationships himself. The following year Westermann attempted to prove 
the genetic unity of Meinhof's Sudanic group, which at this point was de- 
fined mostly in negative terms: it included all those languages that were 
neither Semitic, Hamitic, Bantu, nor Nama-Bushman. In Die Sudansprachen 
(The Sudanic Languages; 1911), Westermann compared five West African 
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languages with three East African languages. Although he felt he had 
accomplished the task assigned him by Meinhof, he was aware that the 
evidence linking the Eastern and Western languages was not altogether 
satisfactory. 

In 1912 Meinhof published Die Sprachen der Hamiten (The Languages of 
the Hamites) in which he ertended Lepsius's ехрапвіоп of the Hamitic 
group. We may recall that to the traditional Hamitic group (Ancient Egyp- 
tian, Berber, Cushitic), Lepsius had added Nama and Hausa. Meinhof now 
proposed Hamitic affiliation for Fula, Maasai, Bari, Nandi, Sandawe, and 
Hadza. Thus for Meinhof African languages fell into one of three groups, 
Bantu, Sudanic, or Hamito-Semitic (see List 3.3). Despite objections from 
various critics with regard to both the Hamitic and the Sudanic group, 
Meinhof's classification remained in fashion through the 19405. 

Westermann, however, had increasing doubts about certain aspects of 
Meinhof's classification. The Sudanic group, in particular, seemed less and 
less secure. In 1927, when he published his best work, “Die westlichen 
Sudansprachen und ihre Beziehungen zum Bantu” (The West Sudanic Lan- 
guages and Their Relationship to Bantu), Westermann left aside the dispa- 
rate East Sudanic languages altogether and concentrated instead on the 
relatively more clear-cut West Sudanic family. In this work Westermann es- 
tablished the genetic unity of the West Sudanic family and offered the most 
accurate subgrouping yet proposed for West Sudanic. In addition, he cited 
sufficient similarities between West Sudanic and Bantu that some kind of 
genetic relationship was apparent, though he declined to be specific about 
its exact nature. 

Neither of these ideas was original with Westermann; numerous scholars 
had noticed scattered resemblances among West African languages, and 
between West African languages and Bantu, from the mid-nineteenth cen- 
tury on. Westermann, however, marshaled the evidence in such detail that 
both conclusions became inescapable (though Westermann himself was 
noncommittal on both, perhaps out of deference to his mentor Meinhof). 
In 1940 Westermann suggested the classification of African languages given 
in List 3.4, but he made it clear that not all the groups were necessarily valid 
genetic units. 

In 1950 African linguistic classification was, in its broadest outline, 
not significantly different from views prevailing a century earlier. To be 
sure, there had been progress in delineating low-level genetic groups (e.g. 
Chadic, Nubian), but there had been remarkably little success іп establish- 
ing how the numerous small families fit together in larger groupings. At 
this point the American linguist Joseph H. Greenberg proposed, in a series 
of articles (Greenberg 1949—50, 1954; published together as Greenberg 
1955), a complete classification of African languages that differed in certain 
dramatic ways from prevailing opinion. In a classification that Greenberg 
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List 3.3. African Language Families List 3.4. African Language Families 
(Meinhof c. 1912) (Westermann 1940) 
BANTU KHOISAN: 
I NAMA 
SUDANIC: 
I EAST II BUSHMAN 
II WEST NEGRO: 
HAMITO-SEMITIC: АТЫП 
II BANTU 
I SEMITIC 
III SUDANIC: 
H HAMITIC: 
4 : A NIGRITIC 
А tAncient Egyptian 
B MANDE 

B CUSHITIC 

C BERBER C SEMI-BANTU 

DH D INNER SUDAN 

ausa 
E Maasai HAMITO-SEMITIC: 
F Fula I SEMITIC 


G Nama II HAMITIC 


termed "conservative," the languages of Africa were divided into 16 groups, 
the genetic unity of each being, according to Greenberg, "in the class of the 
obvious." That classification is shown in List 3.5. Greenberg's classification, 
which was hailed by Welmers (1956: 558) as "a significant mid-century 
milestone . . . almost sensational in its scope and originality" and char- 
acterized by Cole (1960: 223) as "the first attempt at a pan-African clas- 
sification on genetic principles," was revolutionary in several important 
respects. 

As Cole emphasized, Greenberg’s classification was based solely on ge- 
netic principles. Hitherto all attempts at pan-African classifications had fol- 
lowed genetic principles only in identifying such obvious groups as Bantu 
and Semitic. When confronted with a bewildering diversity in the remain- 
ing hundreds of African languages, all earlier classifiers had abandoned ge- 


List 3.5. African Language Families (Greenberg 1949—50) 


1. KHOISAN 8. MABAN 

2. AFRO-ASIATIC (formerly 9. TEMEIN 
Hamito-Semitic) 10. KOMAN 

3. NIGER-CONGO (formerly 11. Songhai 
West Sudanic) 12. Mimi 

4. KORDOFANIAN 13. Fur 

5. CENTRAL SAHARAN 14. Berta 

6. CENTRAL SUDANIC 15. Kunama 

7. EASTERN SUDANIC 16. Teuso 
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пенс principles, ав articulated Бу Indo-Europeanists іп the nineteenth сеп- 
tury (see section 2.3), and had resorted either to nonlinguistic factors such 
as skin color, hair type, and mode of existence or to typological traits like 
gender. By the end of the nineteenth century, it was well known that the 
use of such criteria does not produce valid genetic groups. Why, then, were 
they used for African languages? I have already suggested one reason: the 
traditional linguistic criteria did not seem to work for most African lan- 
guages. But there was a second factor, which was equally important. This 
was the widespread belief among European scholars that European lan- 
guages were "special," and that, though the principles of Indo-European 
comparative grammar might be valid in theory for the rest of the world's 
languages, in the real world the lack of ancient documents for almost all 
African languages, the (supposed) primitive nature of the African peoples, 
and so forth, all conspired to make genetic classification intractable, except 
for many low-level groups. 

First and foremost, Greenberg reaffirmed the irrelevance of nonlinguistic 
and typological traits for the genetic classification of languages. АП 16 of 
his groups were based on the same linguistic principles, and they were all 
claimed to be valid genetic groups. Physical and cultural attributes of speak- 
ers were for the first time ignored. Second, he demonstrated that tradi- 
tional comparative linguistics may be used successfully in any part of the 
world, regardless of the technological level of the people or the literary his- 
tory of their languages. Greenberg’s reclassification of African languages 
had three important results: 

1. The Hamitic group was eliminated. Greenberg showed that each of 
the three original members of Hamitic (Ancient Egyptian, Berber, Cushitic) 
constitutes an independent branch of Hamito-Semitic, and that the Chadic 
family (including Hausa) constitutes a fifth branch. Since there was no valid 
Hamitic group, and since the term Hamitic had racial (and racist) connota- 
tions, Greenberg proposed that the family name Hamito-Semitic be re- 
placed by Afro-Asiatic, the only language family spoken in both Africa and 
Asia (see List 3.6). 

2. The other languages that had been added to Hamitic by Lepsius and 
Meinhof were shown to belong to various groups: (a) Fulani was included 
in the West Atlantic family of Niger-Congo, as several French scholars had 
already suggested; (b) Nama was placed in the Khoisan group; (c) the 
so-called Nilo-Hamitic languages (e.g. Maasai, Nandi, Bari) formed one 
branch of the Nilotic family, which itself was one of seven branches of 
Greenberg's Eastern Sudanic group. 

3. Greenberg added the Adamawa and Ubangian families to Wester- 
mann's West Sudanic phylum and renamed it Niger-Congo. Bantu was 
shown to be a subgroup of a subgroup of Niger-Congo, as in List 3.7. 
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List 3.6. Тһе Afro-Asiatic Family 
(Greenberg 1950) 


AFRO-ASIATIC (formerly 
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List 3.7. Тһе Niger-Congo Family 
(Greenberg 1954) 


NIGER-CONGO (formerly West 


Hamito-Semitic): Sudanic): 

I TAncient Egyptian I WEST ATLANTIC 
JI SEMITIC II MANDE 
Ш BERBER III GUR (= VOLTAIC) 
IV CHADIC IV KWA 

V CUSHITIC V yo 


VI ADAMAWA-UBANGIAN 
VII BENUE-CONGO: 
A BANTU 


Greenberg did not deny the possibility of further consolidation of the 16 
groups in List 3.5 as more and better information became available, and im 
fact suggested that similarities between Niger-Congo апа Kordofanian, 
and between Mimi and Maban, might ultimately lead to such larger group- 
ings. In 1954 he published an update to his classification in which the 16 
original groups were reduced to 12 by the addition of Mimi to the Maban 
group and the grouping of Berta, Kunama, Eastern Sudanic, and Central 
Sudanic in a higher-level node, which he called Macro-Sudanic (see List 
3.8). 

In 1963 Greenberg published the definitive version of his classification, 
in which all African languages were shown to belong to one of four phyla. 
The major differences between this work and his previous classification 
were (1) the grouping of Kordofanian with Niger-Congo to form a family 
called Niger-Kordofanian, and (2) the grouping of all non-Khoisan, non- 
Afro-Asiatic, non-Niger-Kordofanian languages (i.e. branches 5-16 in List 
3.5) in a fourth phylum, for which he proposed the name Nilo-Saharan. 
This highly diversified family corresponds very roughly with what had 
been known as the East Sudanic languages in the earlier literature. The 


List 3.8. The Macro-Sudanic Family 
(Greenberg 1954) 


MACRO-SUDANIC: 
I Berta 
П CENTRAL SUDANIC 
HI EASTERN SUDANIC-KUNAMA: 
A Kunama 
B EASTERN SUDANIC 
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Macro-Sudanic family, which Greenberg renamed Chari-Nile, was consid- 
ered to be one of six primary branches of Nilo-Saharan. 

Greenberg's 1963 classification has formed the basis of all subsequent 
work on African classification. By eliminating the irrelevant nongenetic cri- 
teria that had vitiated all previous classifications, Greenberg demonstrated 
that the principles of genetic classification established in the nineteenth 
century for European languages were equally valid for African languages. 
It was, one might say, a demonstration of linguistic uniformitarianism that 
put African linguistic classification back on track after a century of relative 
stagnation, and in some instances even regression (e.g. the ill-conceived 
Hamitic group). Greenberg's 1963 classification is shown in List 3.9. 


List 3.9. African Language Phyla (Greenberg 1963) 
AFRO-ASIATIC: 



















NILO-SAHARAN: 


I tAncient Egyptian I Songhai 
II SEMITIC П SAHARAN 
III BERBER III MABAN 
IV CHADIC IV Fur 
V CUSHITIC: V CHARI-NILE: 
A NORTHERN A EASTERN SUDANIC 
B CENTRAL B CENTRAL SUDANIC 
C EASTERN C Berta 
D WESTERN D Kunama 
E SOUTHERN VI KOMAN 
NIGER-KORDOFANIAN: KHOISAN: 
I KORDOFANIAN I Hadza 
II NIGER-CONGO: II Sandawe 
A WEST ATLANTIC Ш SOUTHERN AFRICA: 
B MANDE A NORTHERN 
C GUR (= VOLTAIC) B CENTRAL 
D KWA C SOUTHERN 


E BENUE-CONGO 
F ADAMAWA-UBANGIAN 
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3.2  AFRO-ASIATIC 


Roughly 240 Afro-Asiatic languages are spoken across the northern third 
of Africa, from Morocco and Mauritania on the Atlantic seaboard to Egypt, 
the Sudan, Ethiopia, and Somalia on the east coast. In addition, languages 
of the Semitic branch are spoken in many countries of the Near East (e.g. 
Saudi Arabia, Jordan, Israel, Lebanon, Syria, Iraq). This geographic distri- 
bution is shown in Map 3.1. 
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Map 3.1. The Afro-Asiatic family. Arabic is now spoken in the area where Old Egyptian 
once prevailed. 


Afro-Asiatic languages are spoken by an estimated 175 million people, 
but only a few languages, with numerous speakers, account for most of the 
total. Arabic alone has some 100 million speakers. The next most impor- 
tant, in sheer numbers, are Hausa (with 12 million native speakers and an 
additional 13 million who speak it as a second language), Amharic (8 mil- 
lion), Oromo (7 million), Tigrinya (4 million), Shilha (3 million), Hebrew (3 
million), Tamazight (2 million), Somali (2 million), Kabyle (1 million), and 
Riff (1 million). 

The Afro-Asiatic phylum has six distinct branches. Ancient Egyptian, 
which was known in its last years as Coptic, became extinct in the seven- 
teenth century. It was spoken in the region that is today Egypt. Most of the 
11 million Berber speakers are found in Morocco and Algeria, with roughly 
five million each. Berber languages are also found in small numbers in 
Tunisia, Libya, Mauritania, and Senegal. The widely dispersed Tuareg are 
found in Niger, Mali, and Nigeria. Originally, Berber languages were spoken 
over most of northern Africa, but the Arab expansion from the Arabian 
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Peninsula, beginning іп the seventh century, has largely submerged the 
original Berber tongues; most Berber speakers (especially men) are today 
bilingual in Arabic. The Chadic family, with about 125 languages and some 
30 million speakers, is located primarily in Chad, Niger, Nigeria, Came- 
roon, and the Central African Republic. A single language, Hausa, ac- 
counts for about 80 percent of Chadic speakers. There are a little more than 
a million Omotic speakers, mostly in western Ethiopia and northern Kenya, 
speaking some three dozen languages. About 35 Cushitic languages, with 
12 million speakers, are found in the Sudan, Ethiopia, Kenya, Somalia, and 
Tanzania. Semitic languages are spoken in much of northern Africa and the 
Near East. Altogether there are some 20 languages and 120 million speak- 
ers, though Arabic, with 100 million speakers, accounts for most of this 
total. Most Semitic languages are found in Ethiopia. 


3.2.1 History of Afro-Asiatic Classification 


For cultural and geographic reasons the Semitic family was the first 
branch of Afro-Asiatic (indeed the first African family) to be recognized 
as a genetic group of languages by European scholars. A Frenchman, 
Guillaume Postel, reported the affinity of Hebrew, Arabic, and Aramaic in 
1538, a connection long known to Jewish and Islamic scholars. In 1702 Hiob 
Ludolf extended this Semitic nucleus to include the Ethiopic Semitic lan- 
guages of East Africa, and finally im 1781 von Schlózer proposed the name 
Semitic. 

During the first half of the nineteenth century, Ancient Egyptian and the 
Berber апа Cushitic families became associated in a group called Hamitic, 
the defining characteristic of which was that the Hamitic languages all 
showed resemblances to the Semitic family. The Berber scholar Francis W. 
Newman sought to establish the relationship of Hausa to the Hamito- 
Semitic group in 1844, an affinity accepted by Max Müller in the 18505 and 
Karl Lepsius in the 18605. Furthermore, it soon became apparent that 
Hausa was related to other languages in a family now known as Chadic. 
H. Barth established the affinity of Hausa and Logone in 1862, and in 1884 
G. A. Krause added Musgu, Bade, Margi, and others to the group. Thus by 
the late nineteenth century a fairly coherent picture of the Hamito-Semitic 
family had emerged, as shown in List 3.10. Lepsius, as we saw in the pre- 
ceding section, incorrectly included Nama in the Hamitic group, but most 
scholars did not follow him on this point. 

Despite the fairly clear evidence of the Hamito-Semitic affinity of Hausa 
(and hence the whole Chadic family), many scholars continued to express 
reservations, or even outright denial, until the mid-twentieth century. 
Greenberg's 1950 classification effectively ended the century-old debate. 
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List 3.10. The Hamito-Semitic Family 
(late 19th century) 


HAMITO-SEMITIC: 
I SEMITIC 
II HAMITIC: 
А tAncient Egyptian 
B BERBER 
C CUSHITIC 
D CHADIC 


Why should scholars have persisted for so long in excluding Chadic lan- 
guages from Hamito-Semitic when the linguistic evidence makes it obvious 
that this is where they belong? The answer is that although Semitic, Cushitic, 
Berber, and Ancient Egyptian are (or were) spoken by people of a predomi- 
nantly Caucasoid racial type, speakers of Chadic languages are Negroid, 
and for this reason alone they were segregated from Hamito-Semitic and 
incorrectly lumped with the other “Negro” languages. 

Most nineteenth-century scholars (and a good many in the twentieth 
century as well) suffered from an ethnocentrism that often included out- 
right racism. For scholars who fervently believed in the superiority of the 
“white race,” white languages, and the like, the idea of a black branch 
(Chadic) in a white family (Hamito-Semitic) was disconcerting, if not re- 
pugnant. Despite the fact that a portion of the scholarly community had 
been continuously aware of the Hamito-Semitic affinity of Hausa and other 
Chadic languages from the mid-nineteenth century on, many scholars con- 
tinued either to reject or to ignore this clear linguistic fact until the mid- 
twentieth century. Two events finally brought about the acceptance of 
Chadic as a member of Hamito-Semitic. The first was the work of Johannes 
Lukas in the 19305, which established quite clearly the Chadic family 
(Lukas called it Chado-Hamitic); the second was Greenberg’s 1950 classifi- 
cation, which summarized previous arguments and adduced additional 
evidence for the inclusion of Chadic in Hamito-Semitic. 

In rejecting Hamitic as a valid genetic group, Greenberg was in effect 
rejecting what had become the favored subgrouping for the Hamito-Semitic 
family (see List 3.10 above). As with Chadic, so too with Hamitic were 
Greenberg’s proposals decisive, though not original. As early as 1860 C. 
Lottner had proposed four primary branches for Hamito-Semitic (see List 
3.11). Such a subgrouping was also advocated by several French linguists in 
the early twentieth century (Pierre Lacau, Maurice Delafosse, and Marcel 
Cohen), but it was Greenberg’s rejection of Hamitic that finally tipped the 
scales in its favor (with the inclusion of Chadic as a fifth branch). 
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List 3.11. The Hamito-Semitic Family List 3.12. The Afro-Asiatic Family 
(Lottner 1860) (Greenberg 1950) 
HAMITO-SEMITIC: AFRO-ASIATIC (formerly 
I TAncient Egyptian Hamito-Semitic): 
II SEMITIC I tAncient Egyptian 
HI BERBER П SEMITIC 
ГУ CUSHITIC Ш CUSHITIC 
т та LE с eS 25 ТҮ ВЕКВЕК 
V CHADIC 


What was original in Greenberg's treatment of Hamito-Semitic, which he 
renamed Afro-Asiatic (a term first used by Delafosse in 1914), was the com- 
bination of including Chadic and breaking up the fallacious Hamitic group 
into three independent branches, thus producing the subgrouping in List 
3.12. In proposing this subgrouping, Greenberg was not claiming that all 
five branches of Afro-Asiatic are in fact equidistant from each other, but 
rather was rectifying the error of grouping Ancient Egyptian, Cushitic, and 
Berber together in a family (Hamitic) for which there was no linguistic evi- 
dence. More recent proposals for Afro-Asiatic subgrouping will be dis- 
cussed below. 


3.2.2 Present Status of Afro-Asiatic Classification 


Although Afro-Asiatic has proved to be the least controversia] of Green- 
berg's four African phyla, a number of disputes and revisions have de- 
veloped in the 30 years since it was first proposed. The most significant 
revision involves the Cushitic family. In contrast to Berber and Semitic, 
which are very close-knit families whose internal unity cannot be ques- 
tioned, the constituent branches of Cushitic are quite disparate, and doubt 
has arisen that they constitute a valid genetic unit. In his 1950 classification 
Greenberg followed the Italian M. M. Moreno, who in the late 19305 had 
divided the Cushitic languages into four subgroups: Northern, Central, 
Eastern, and Western. In 1963 Greenberg added a fifth branch, Southern, 
which consists of severallanguages spoken to the south of the main body of 
Cushitic languages (see List 3.13). 

In 1969 ап American linguist, Harold C. Fleming, proposed that Western 
Cushitic is not a part of Cushitic at all, but rather constitutes a sixth pri- 
mary branch of Afro-Asiatic, for which he suggested the name Omotic. His 
proposal gained fairly wide support in the 19705. Recently the inclusion of 
the Northern branch (which is in fact a single language, Beja) in Cushitic 
has also been questioned by a number of scholars, including Fleming, 
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List 3.13. The Cushitic Family 
(Greenberg 1963) 


CUSHITIC: 

І МОКТНЕКМ 
П CENTRAL 
HI EASTERN 
IV WESTERN 
V SOUTHERN 


M. Lionel Bender, and Robert Hetzron. Hetzron (1980) has proposed a re- 
vised subgrouping for Cushitic (which for him excludes both Omotic and 
Beja) in which the Eastern group is divided into two distinct primary 
groups (Highland and Lowland), with Southern Cushitic being a sub- 
branch of the latter. The Cushitic classification adopted in this volume fol- 
lows suggestions from Christopher Ehret; Omotic is excluded, and Beja is 
treated as coordinate with the rest (i.e. Central, Eastern, Southern). 

Within Afro-Asiatic there is near unanimity that Ancient Egyptian, Se- 
mitic, Chadic, and Berber are valid genetic nodes, although there is dis- 
agreement over whether Cushitic constitutes one, two, or three nodes (de- 
pending on the status one grants Omotic and Beja). No clear subgrouping 
has yet emerged for the five to seven groups that make up Afro-Asiatic, 
though several tentative suggestions have recently been made. List 3.14 il- 
lustrates a sample of current views, ranging from the essentially agnostic 
position of Hetzron to the more highly structured proposals of Ehret, New- 
man, Fleming, and Bender. It is apparent that there is as yet little consensus 
on the internal relationships of Afro-Asiatic. 

Finally, let us turn briefly to the internal subgrouping of the various 
Afro-Asiatic subfamilies. Cushitic (discussed above) and Ancient Egyptian 
(a single language) require no further comment. Hetzron (1974) summa- 
rizes the history of Semitic classification, which was to a large extent estab- 
lished by German scholars in the late nineteenth and early twentieth cen- 
turies. The classification adopted in this volume follows Hetzron (1972, 
1974) in dividing the Semitic family into two primary subgroups: (1) an ex- 
tinct Eastern branch, which consisted of a single language, Akkadian, and 
(2) a Western branch with two subbranches, (a) Central and (b) South. The 
Central group has three components (Aramaic, Canaanite, Arabic), and the 
Southern group has two (South Arabian, Ethiopic) (see List 3.15). 

Because the Chadic family contains -a large number of poorly docu- 
mented languages, its internal relationships have only recently begun to be 
clarified. Early classifications were organized along geographic, rather than 


List 3.14. Proposed Subgroupings of the Afro-Asiatic Family 


Voegelin & Voegelin 1977 Ehret 1979 Newman 1980* 
AFRO-ASIATIC: AFRO-ASIATIC: AFRO-ASIATIC: 
I TAncient Egyptian I OMOTIC I BERBER-CHADIC: 
II SEMITIC II ERYTHRAIC: A BERBER 
III BERBER A CHADIC B CHADIC 
IV CHADIC B CUSHITIC II EGYPTIAN-SEMITIC: 
V CUSHITIC C NORTH AFRO-ASIATIC: А tAncient Egyptian 
VI ОМОТІС 1 ТАпсіепі Egyptian В SEMITIC 
2 BERBER-SEMITIC: III CUSHITIC: 
a BERBER A. Beja 
b SEMITIC B CUSHITIC PROPER 
Fleming 1981 Greenberg 1981 Bender 1981 
AFRO-ASIATIC: AFRO-ASIATIC: AFRO-ASIATIC: 
I OMOTIC: I OMOTIC: I NORTH: 
H AFRO-ASIATIC PROPER: II AFRO-ASIATIC PROPER: A tAncient 
A CUSHITIC A tAncient Egyptian Egyptian 
B BEJA-CHADIC-BERBER- B SEMITIC B SEMITIC 
ANCIENT EGYPTIAN: C BERBER C BERBER 
1 Beja D CHADIC II WEST (= CHADIC) 
2 CHADIC-BERBER: E CUSHITIC III Beja 
a CHADIC IV SOUTH: 
b BERBER A CUSHITIC 
3 tAncient Egyptian B OMOTIC 
C SEMITIC 
Hetzron 1982 
AFRO-ASIATIC: 
I tAncient Egyptian 
П SEMITIC 
III BERBER 
IV CHADIC 
V Beja 
VI CUSHITIC 
VII OMOTIC 


^ Newman excludes Omotic from Afro-Asiatic. 
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List 3.15. Тһе Semitic Family (Hetzron 1972, 1974) 










SEMITIC: 
I tEAST (Akkadian) ІШ WEST] 
II WEST: B SOUTH: 
А CENTRAL: 1 South Arabian 
1 Aramaic 2 ETHIOPIC: 
2 ARABO-CANAANITE: a NORTH 


a CANAANITE 
b ARABIC 


b SOUTH 


linguistic, lines. Greenberg (1963) divided the family into nine subgroups, 
without further specification of relationships. Newman and Ma (1966) 
distinguished two first-order groups, Biu-Mandara and Plateau-Sahel. Fur- 
ther research, however, led Newman (1977) to divide Plateau-Sahel into 
two first-order groups (i.e. East Chadic and West Chadic) coordinate with 
Biu-Mandara. In addition, Newman considers the small Masa group a pro- 
visional fourth branch, since it does not appear to share the defining char- 
acteristics of any of the other three groups. The Chadic classification pre- 
sented in this volume, as shown in List 3.16, is based on Newman (1977). 

No firm picture of the internal relationships of the Berber family has yet 
emerged. I have adopted the classification shown in List 3.17, which fol- 
lows in the main Militarev (1984). Some scholars doubt that Guanche (for- 
merly spoken in the Canary Islands) is in fact Berber; others (Hetzron 1982, 
Militarev 1984) consider it a primary subgroup of Berber opposed to all the 
rest of the family. 

In the Omotic family there is a clear-cut cleavage between a small South- 
ern branch and a larger Northern group. The Northern branch, in turn, 
consists of three subgroups: Dizoid, Mao, and Gonga-Gimojan. The classi- 
fication shown in List 3.18 follows Fleming's work. The Mao group repre- 
sents the most problematical area—some scholars are skeptical that the 
Mao group is really Omotic. 


List 3.16. The Chadic Family 
(Newman 1977) 


CHADIC: 

I MASA 

II EAST 
Ш BIU-MANDARA 
IV WEST 
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List 3.17. The Berber Family (Militarev 1984) 


BERBER: [III BERBER PROPER] 
I tGuanche C WESTERN: Zenaga 
П ТЕаві Numidian (= Old Libyan) D NORTHERN: 
III BERBER PROPER: 1 ATLAS: Shilha, 
A EASTERN: Siwa, Awjila, Tamazight 
Sokna 2 KABYLE: Kabyle 
B TUAREG 3 ZENATI 





List 3.18. The Omotic Family (Fleming 1976a,b, 1981) 








OMOTIC: 

І SOUTH [II NORTH] 

П NORTH: C GONGA- 
A DIZOID GIMOJAN: 
B MAO 1 GONGA 


2 GIMOJAN 
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3.3 NIGER-KORDOFANIAN 


The vast Niger-Kordofanian phylum covers most of the southern two- 
thirds of Africa, as seen in Map 3.2. The small Kordofanian group, with 
some 30 languages and 200,000 speakers, is one of two primary branches. 
Kordofanian languages are spoken in the Sudan, where they are isolated 
from the rest of the Niger-Kordofanian phylum by Afro-Asiatic and Nilo- 
Saharan languages. The other primary branch, Niger-Congo, contains more 
than 1,000 languages and has some 180 million speakers. 

The Bantu subgroup of Niger-Congo covers most of the southern half of 
Africa and itself accounts for close to 500 languages and probably more 
than 100 million speakers. Bantu languages are found in all African coun- 
tries that lie south of a line running from southeastern Nigeria to south- 
ern Kenya. Other important Niger-Congo subfamilies (e.g. West Atlantic, 
Mande, Gur [= Voltaic]) are located in West Africa in а belt running from 
Senegal to Cameroon. The Adamawa-Ubangian subfamily, however, lies 
north of the Bantu family in a belt running from eastern Nigeria into the 
Sudan. The numerically strongest Niger-Congo languages (with millions 
of speakers in parentheses) are Mandinka (3), Bambara (1.5), Mende (1), 
Dyula (1), Fula (= Fulani, 8), Wolof (2), More (= Mossi, 2), Цо (1), Akan 
(2), Ewe (1), Yoruba (8), Igbo (6), Efik (2), Tiv (1), and the Bantu languages 
Losengo, Kikuyu, Kamba, Sukuma, Swahili, Sango, Kongo, Luganda, 
Luyia, Rwanda, Rundi, Luba, Bemba, Nyanja, Makua, South Mbundu, 
Shona, Tswana, Sotho, Xhosa, Zulu, Tsonga, and Ronga, each of which 
has more than a million speakers. Swahili is used as a lingua franca along 
the east coast of Africa by upwards of 20 million people. 
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Map 3.2. The Niger-Kordofanian family 


3.3.1 History of Niger-Kordofanian Classification 


As we saw in section 3.1, the first part of the Niger-Kordofanian phylum 
to be recognized was the widespread and closely related Bantu family, 
whose unity was already apparent in the eighteenth century. During the 
second half of the nineteenth century, a number of different language fami- 
lies were established in West Africa. In 1854 a German missionary, Sigis- 
mund W. Koelle, published what is now recognized as a landmark in Af- 
rican linguistics: the Polyglotta Africana. This work contained comparative 
wordlists from 156 African languages, with all entries written in a standard 


3.3 Niger-Kordofanian 97 


phonetic alphabet developed by Lepsius. Koelle grouped the languages, оп 
the basis of perceived similarities, into 11 groups, with 40 languages left 
unclassified. Of the six branches making up Greenberg’s (1963a) Niger- 
Congo family (see List 3.19), allbut Adamawa-Ubangian were foreshadowed 
in the Polyglotta. 


List 3.19. The Niger-Congo Family 
(Greenberg 1963a) 


NIGER-CONGO: 
I WEST ATLANTIC 
II MANDE 
IH GUR (= VOLTAIC) 
IV KWA 
V BENUE-CONGO 
VI ADAMAWA-UBANGIAN 


Koelle's first group was West Atlantic (named by him). His second group 
contained Mande languages, though the name Mande was first applied to 
the group in 1867 by H. Steinthal. Koelle's third, fifth, and sixth groups 
contained languages belonging to the Kwa family (Kru, Ewe, Yoruba, Igbo, 
Edo, Nupe); the name Kwa was coined by G. A. Krause in 1885. The fourth 
group contained Gur (also known as Voltaic) languages. The term Gur was 
proposed for the group in 1885 by G. A. Krause; Voltaic was suggested by 
Delafosse in 1911. Both names are still widely used. Koelle's groups 8-11, 
which he distinguished from the other groups as "South African" lan- 
guages, contained various components of Benue-Congo, including Bantu. 
The name Benue-Congo was proposed by Greenberg in 1963 as.an exten- 
sion of Westermann's Benue-Cross group, from which the Bantu family had 
been excluded. The only Niger-Congo branch that was not distinguished in 
Koelle's work is the Adamawa-Ubangian family. Delafosse recognized and 
named the Ubangian family in 1924, but the Adamawa group went un- 
noticed until Greenberg (1950) joined it with the Ubangian family. 

Striking similarities among the various West African families, and be- 
tween these families and Bantu, were also noticed by numerous scholars 
during the last half of the nineteenth century. Bishop O. E. Vidal had recog- 
nized the affinity of certain languages of Sierra Leone with the Bantu family 
in 1852. A decade later Wilhelm Bleek asserted that a single family covered 
most of Southern and West Africa. Other scholars arrived independently at 
the same conclusion. It is thus clear that a rough outline of the Niger-Congo 
family had already emerged in the nineteenth century. 

Unfortunately, the first half of the twentieth century witnessed regres- 
sion, rather than progress, in untangling Niger-Congo relationships. In the 
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work of Meinhof, Malcolm Guthrie, and others, the affinity of Bantu and 
West African languages is discounted, with similarities being attributed to 
loans from one language to another. Bantu is regarded as a separate and 
distinct family. Although Westermann was reluctant to commit himself to 
any precise declaration about the relationship between Bantu and West Af- 
rican languages, it seems likely that he conceived the relationship as one of 
coordinate branches. When, in 1950, Greenberg proposed that Bantu was 
genetically a branch of a branch (i.e. Benue-Congo) of Niger-Congo, he was 
in fact recommending a return to views that had been prevalent a century 
earlier. Greenberg's proposals were met with hostility from some Bantuists, 
who resented having their formerly independent family accorded a tax- 
onomically less significant position in the Niger-Congo family. The evi- 
dence in favor of Greenberg's classification of Bantu was so overwhelming, 
however, that it quickly gained general acceptance. 


List 3.20. The West Sudanic Family (Westermann 1927) 


WEST SUDANIC: 
I KWA 
II BENUE-CROSS 
III TOGO-REMNANT 
IV GUR (= VOLTAIC) (including Songhai) 
V WEST ATLANTIC (excluding Fula) 
VI MANDE 


Although Westermann had failed to grasp the nature of the relationship 
between Bantu and the West African languages he called West Sudanic, he 
nonetheless had made considerable progress in delineating the subgroups 
within West Sudanic, as his 1927 classification, shown in List 3.20, demon- 
strates. Greenberg's (1963a) revision of this classification entailed relatively 
few changes: (1) Bantu was included in Benue-Cross, which was accord- 
ingly renamed Benue-Congo; (2) the Togo-Remnant group was added to 
Kwa; (3) Songhai was removed from Gur (and Niger-Kordofanian); (4) Fula 
was added to West Atlantic; and (5) Adamawa-Ubangian was proposed as 
an additional primary branch. 

In 1963 Greenberg joined the large Niger-Congo family with the small 
Kordofanian group to form Niger-Kordofanian, the possibility of which he 
had already foreseen in 1954. 


3.3.2 Present Status of Niger-Kordofanian Classification 


During the past two decades classificatory work on Niger-Kordofanian 
has concentrated almost entirely on the subgrouping of the family. The 
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only group of languages whose membership іп Niger-Kordofanian has 
been questioned is the Kadugli branch of Kordofanian. Thilo C. Schadeberg, 
who has worked extensively on Kordofanian languages, concludes “that 
it is presently not possible to assume a genetic relationship between 
Kadugli and (Niger-)Kordofanian" (Schadeberg 1981a: 304) and goes on to 
suggest that membership in Nilo-Saharan is a distinct possibility. (In per- 
sonal correspondence both Morris Goodman and Lionel Bender have 
pointed out similarities between Kadugli and languages belonging to Nilo- 
Saharan.) Greenberg (1963a: 149) already recognized that the Kadugli 
group "shows considerable divergence from the remainder" of the Kor- 
dofanian languages. 


List 3.21. The Kordofanian Family 
(Greenberg 1963a) 


KORDOFANIAN: 
I KADUGLI 
II KORDOFANIAN PROPER: 
A KATLA 
B HEIBAN 
C TALODI 
D RASHAD 


The classification adopted in this volume, shown in List 3.21, retains Ka- 
dugli as a branch of Kordofanian, but considers it coordinate with the other 
four branches. The five subfamilies that make up Kordofanian are all valid 
genetic nodes; so too is Kordofanian Proper. Whether the higher-level node 
Kordofanian is itself correct depends on whether or not Kadugli is directly 
related to the other four groups. It is now widely accepted that the Kordofa- 
nian languages are distantly related to the Niger-Congo family. 

The subgrouping of Niger-Congo, which contains roughly 1,000 lan- 
guages, is exceedingly difficult. Greenberg (19632) laid the foundation for 
future research by dividing Niger-Congo into six primary groups (see List 
3.19, above). Furthermore, he recognized that these six branches are not 
equidistant: "Kwa and Benue-Congo are particularly close to each other 
and in fact legitimate doubts arise concerning the validity of the division 
between them. On the other hand West Atlantic seems more remotely re- 
lated to the other group and Mande the most distant of all" (Greenberg 
1963a: 39). 

William Welmers had suggested as early as 1956 that the Mande family 
was the most divergent group; work by Welmers, Kay Williamson, and 
others has confirmed Greenberg’s suspicion that Kwa and Benue-Congo аге 
in fact not distinct subgroups. Greenberg (1963a: 39) also considered the 
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inclusion of Kru апа [о in Kwa to be "tentative," and several French schol- 
ars (e.g. Calame-Griaule 1978: 66) have questioned the Gur affiliation of the 
Dogon language as posited by Greenberg and others. In an attempt to re- 
fine the subgrouping of Niger-Congo, Patrick R. Bennett and Jan P. Sterk 
(1977) have proposed a reclassification of Niger-Congo that reflects the 
modifications suggested above (see List 3.22). The languages traditionally 
assigned to Kwa and Benue-Congo now constitute the node South Central 
Niger-Congo, with a single exception: the Kru group, tentatively included 
by Greenberg in Kwa, is placed in North Central Niger-Congo. 


List 3.22. The Niger-Kordofanian Family 
(Bennett & Sterk 1977) 


NIGER-KORDOFANIAN: 
I KORDOFANIAN 
II MANDE 
III NIGER-CONGO: 
А WEST ATLANTIC 
B CENTRAL NIGER-CONGO: 
1 NORTH: 
a KRU 
b GUR(= VOLTAIC) 
c ADAMAWA-UBANGIAN 
2 SOUTH: 
a WESTERN 
b JO 
c EASTERN: 
i CENTRAL NIGER 
ii YORUBOID 
ii EDO 
iv LOWER NIGER 
v JUKUNOID 
vi DELTA-CROSS 
vii EFIKOID 
viii EASTERN CROSS 
ix BENUE-ZAMBESI 


The Niger-Kordofanian classification adopted in this volume follows 
Bennett and Sterk's proposed subgrouping with a few exceptions: (1) Mande 
is considered closer to Niger-Congo than Kordofanian; (2) Dogon is treated 
as a fourth branch of North Central Niger-Congo; and (3) within South 
Central Niger-Congo, (a) the Western branch is divided into four sub- 
branches rather than two, and (b) their Bantoid subgrouping has been re- 
placed by one following Greenberg's (1974) Bane-Bantu dichotomy. 

The internal subgrouping of the various families that constitute the 
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Niger-Congo phylum presents a host of problems, only the most important 
of which can be touched upon here. Despite its genetic distance from the 
rest of Niger-Congo, the Mande group is internally quite homogeneous 
and forms a well-defined genetic node. Work by Welmers has established 
two primary branches, each consisting of two subbranches. There is, in ad- 
dition, a single language, Bobo-Fing, that appears to lie outside either pri- 
mary branch (see List 3.23). 


List 3.23. The Mande Family List 3.24. The West Atlantic Family 
(Welmers 1971) (D. Sapir 1971) 
MANDE: WEST ATLANTIC: 
I Bobo-Fing I Bijago 
II NORTHERN-WESTERN: II NORTHERN: 
A NORTHERN А SENEGAL 
B SOUTHWESTERN B CANGIN 
III SOUTHERN-EASTERN: C BAK 
A SOUTHERN D EASTERN SENEGAL- 
B EASTERN GUINEA BISSAU 


E MBULUNGISH-NALU 
III SOUTHERN: 

А Sua 

B Limba 

C MEL 


Like Mande, the West Atlantic family is subdivided into two groups and 
an isolated language, Bijago, which appears to form an independent pri- 
mary branch. Unlike Mande, however, the West Atlantic group is extremely 
heterogeneous, and in fact the validity of a West Atlantic node has been 
questioned by Dalby (1965). Bennett and Sterk (1977: 248) summarize the 
internal diversity of West Atlantic as follows: 


West Atlantic . . . is a very diverse group, containing at least three major subdivi- 
sions. It is possible that some language groups traditionally assigned to West Atlan- 
tic are in fact coordinate branches of Niger-Congo. There is no apparent common 
innovation linking West Atlantic, and evaluation of its status as a well-defined sub- 
group of Niger-Congo must await further detailed investigation. At present, all that 
can be said is that the lexicostatistical distance between branches of West Atlantic is 
nearly as great as that between West Atlantic and the remainder of Niger-Congo. 


The classification used in this volume follows the work of J. David Sapir 
(1971), as shown in List 3.24. 

The Kru family is a close-knit group. Lafage (1978) subdivides it into two 
groups and three languages isolated from the main body of Kru speakers, 
as in List 3.25. 
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List 3.25, Тһе Kru Family List 3.26. The Gur Family 
(Lafage 1978) (Manessy 1978) 
KRU: GUR: 
І Aizi I Bariba 
II Kuwaa П KULANGO-LORHON 
HI Seme Ш SENUFO-TUSYA 
IV EASTERN IV CENTRAL: 


V WESTERN A KURUMFE-OTI-VOLTA 
M 77 a B DOGHOSE-GURUNSI 


With Gur (= Voltaic) we are again dealing with a diverse family. As I 
noted above, the assignment of Dogon to Gur by Westermann, Greenberg, 
and others is still a subject of dispute. Within Gur there are two fairly well- 
defined groups (Central and Senufo), as well as a few smaller groups or 
isolated languages. I have in the main followed the subgrouping worked out 
by Gabriel Manessy (see List 3.26), especially with regard to the Central 
branch. 

The Adamawa-Ubangian branch of Niger-Congo is both diverse and 
poorly documented; the internal subgrouping is consequently less firmly 
established than for other groups. The fundamental dichotomy into two 
branches, Adamawa and Ubangian, is beyond doubt, since the gulf be- 
tween the two is deep. Within each branch the relationships are less clear; 
Adamawa shows greater internal diversity than Ubangian. For Adamawa I 
have followed Boyd (1978), for Ubangian Barreteau and Моліпо (1978) and 
Bouquiaux and Thomas (1980), as shown in List 3.27. 


List 3.27. The Adamawa-Ubangian Family (Boyd 1978, 
Barreteau & Mofiino 1978, Bouquiaux & Thomas 1980) 


ADAMAWA-UBANGIAN: 


I ADAMAWA: [1 ADAMAWA] 
A Kam K MBUM-MUNDANG 
B Longuda L YUNGUR 
C Fali M BAMBUKA 
D Nimbari N BUA 
E Kim П UBANGIAN: 
F WAJA A GBAYA-GBANZILI 
G CHAMBA B BANDA 
H DAKA C ZANDE 
I DURU D AMADI 
J MUMUYE-YENDANG E MONDUNGA 





List 3.28. Тһе South Central Niger-Congo Family 
(Bennett & Sterk 1977, Greenberg 1974, and others) 


SOUTH CENTRAL NIGER-CONGO: 
I IJO-DEFAKA 
II WESTERN: 
A VOLTA-COMOE 
B CENTRAL TOGO 
C GA-ADANGME 
D TOGO 
Ш EASTERN: 
A CENTRAL NIGER 
B YORUBA-NORTHERN AKOKO 
С EDO 
D LOWER NIGER 
E JUKUNOID 
F DELTA CROSS 
G LOWER CROSS 
H UPPER CROSS 
I BENUE-ZAMBESI: 
1 CARA 
2 МҮІМА: 
а PLATEAU 
b WEL: 
i BENDI 
ii BANTOID: 
о NON-BANTU: 
І MAMBILA-VUTE 
П TIV-BATU 
В BROAD BANTU: 
I BANE: 
A JARAWAN-EKOID 
В МАМҒЕ 
С GRASSFIELDS 
D МЕМСНОМ 
Е BEBOID 
Е ТІКАК 
П NARROW BANTU: 
A NORTHWEST 
B CENTRAL 
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Тһе remainder of the Niger-Congo family consists of those languages 
that have traditionally been assigned to either the Kwa or the Benue-Congo 
families. The difficulties inherent in subgrouping a large number (c. 750) of 
similar languages have already been underscored by the untenability of 
even the Kwa-Benue-Congo distinction. Bennett and Sterk (1977) have pro- 
posed the name South Central Niger-Congo for the group containing Kwa 
and Benue-Congo languages. Within this group it is possible to identify 
numerous low-level genetic groups (e.g. Volta-Comoe, Edo, Jukunoid, 
Plateau, Grassfields, Bantu), but establishing the correct subgrouping for 
these many groups is by its very nature an extremely delicate problem, es- 
pecially within the very closely related Bantoid group. 

For the overall structure of South Central Niger-Congo (see List 3.28), I 
have generally followed Bennett and Sterk (1977), with the exception that 
the Bantoid subgrouping is organized according to Greenberg (1974). Most 
of the subgroups within South Central Niger-Congo are based on Hoffman 
(1974), with modifications suggested by Kay Williamson. Subgroups within 
Bantoid follow a variety of sources (including Hyman 1980а, Stallcup 
1980a,b, Hombert 1980, Leroy 1980); the Bantu family is based on Bastin 
(1978). In general the South Central Niger-Congo languages become more 
and more similar as one progresses down the tree until, with the Narrow 
Bantu family, we have a group of languages so closely related that they are 
sometimes described (a bit facetiously) as 500 dialects of a single language. 
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3.4 NILO-SAHARAN 


Nilo-Saharan languages are spoken in the interior of Africa, from Mali in 
the west to Egypt, the Sudan, Ethiopia, Kenya, and Tanzania on the east 
coast. Their geographic distribution is shown in Map 3.3. They are often 
found in "islands," surrounded by either Afro-Asiatic or Niger-Congo lan- 
guages. Altogether there are about 140 languages, with a total of perhaps 
11 million speakers. Few of the languages have attracted much attention. 
Kanuri, Luo, and Nubian each have a million speakers, and Maasai and 
Songhai are somewhat well known from the ethnographic literature. 


3.4.1 History of Nilo-Saharan Classification 


The untangling of the relationships among the numerous and disparate 
languages of Central and East Africa, culminating in Greenberg’s (1963) 
Nilo-Saharan hypothesis (see List 3.9, above), proceeded at a considerably 
slower pace than was the case with the other three African language phyla. 
There are several reasons why Nilo-Saharan was the last phylum to be rec- 
ognized and why it remains the most controversial of Greenberg's four 
phyla. (1) European contact with speakers of Nilo-Saharan languages gen- 
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Map 3.3. The Nilo-Saharan family 


erally took place at a later date than with speakers of Niger-Kordofanian, 
Afro-Asiatic, and Khoisan languages primarily because these three phyla 
are (or were) spoken along the African coast; Nilo-Saharan is restricted to 
more inaccessible areas of Central Africa. (2) А consequence of this relative 
inaccessibility is that Nilo-Saharan has always been a “literature-poor” phy- 
lum (i.e. there are fewer and poorer sources of information), like Khoisan, 
in stark contrast with the "literature-rich" Afro-Asiatic and Niger-Kordofa- 
nian phyla. (3) Nilo-Saharan contains fewer languages (c. 140) than either 
Afro-Asiatic (c. 200) or Niger-Kordofanian (c. 1,000). (4) Probably the most 
critical factor, however, is that the affinity of the various constituents of 
Nilo-Saharan is simply less obvious than in the case of the other three 
phyla. This internal heterogeneity was quite apparent in Greenberg’s first 
` classification, in 1950, which posited 12 independent families rather than a 
single phylum. (5) Of these 12 independent groups, four are single lan- 
guages (Songhai, Fur, Kunama, Berta), and several others (Saharan, Maban, 
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Котап) contain only a few languages. Isolated languages, апа small groups, 
are intrinsically more difficult to classify than large groups, for the simple 
reason that they contain fewer witnesses to their historical past. 

During the nineteenth and early-twentieth centuries a number of low- 
level groups were recognized in Central and East Africa. Heinrich Barth 
noted the affinity of two Saharan languages in 1853; Lepsius identified 
Nilotic in 1880; Friedrich Müller perceived the Bongo-Bagirmi, Moru-Madi, 
and Mangbetu-Asua groups in 1889; Maurice Gaudefroy-Demombynes dis- 
covered the Maban group in 1907; and so forth. The isolated languages 
(Songhai, Fur, Berta, and Kunama) came to European attention during 
roughly the same period. 

One often finds in the history of classification that a larger family is built 
around a smaller group of languages that serves as a kind of reference 
point, or nucleus, for the expanded family. The Semitic family served this 
purpose within Afro-Asiatic (see section 3.2) and the same was also true to 
some extent for Bantu within Niger-Kordofanian (see section 3.3). The sub- 
grouping that results from combining families in this way is usually of du- 
bious value, having more to do with the historical accretion of knowledge 
than with the genetic relationships of the languages. The demise of the 
Hamitic family, discussed in section 3.2, illustrates the point. For Nilo- 
Saharan, the Nilotic languages played this pivotal role. 

In the early twentieth century two of the principal unresolved questions 
were the precise definition of the Nilotic group and the identification of 
possible relatives of the group, however defined. In 1902 Harry Johnston 
divided the Nilotic languages into two branches. Ten years later Meinhof 
asserted that these two groups, which he called Nilotic (e.g. Dinka, Nuer, 
Luo, Shilluk) and Nilo-Hamitic (e.g. Maasai, Bari, Nandi) were in fact not 
related, thus contradicting the opinion of such earlier scholars as Lepsius 
and Johnston. Pointing to the presence of gender in certain Nilo-Hamitic 
languages (e.g. Maasai), and its absence in Nilotic, Meinhof proposed that 
languages in the Nilo-Hamitic group were “mixed languages,” the prod- 
ucts of a Nilotic mother and a Hamitic father. Meinhof chose to overlook 
the numerous similarities in vocabulary between the two groups, prefer- 
ring to rely on the typological feature of gender. 

The fortunes of Nilo-Hamitic fared as poorly as those of Hamitic itself, 
and for much the same reason. Both groups were based оп typological 
traits and a fallacious theory of language mixture. When Greenberg re- 
joined the two groups in 1950, he was in fact returning to a view widely 
held a half-century earlier. Subsequently, іп 1955, Oswin Kóhler showed 
that the group previously called Nilo-Hamitic comprised instead two dis- 
tinct groups, both of which appear equally distant from the “Nilotic” 
group. He proposed calling these two branches Eastern and Southern 
Nilotic; the traditional “Nilotic” group was renamed Western Nilotic. 
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List 3.29. The Nilotic Family (Köhler 1955) 









NILOTIC: 

I WESTERN: III SOUTHERN: 
A LUO A KALENJIN 
B DINKA-NUER B TATO 

II EASTERN: 
A BARI 


B LOTUXO-TESO 


Greenberg (1963: 128) accepted this tripartite division, with the qualifica- 
tion that "if anything, Eastern Nilotic . . 15 closer to Western Nilotic . . . 
than it is to Southern Nilotic . . . and may once have formed a unity with 
it." The structure of the Nilotic family is shown in List 3.29. 

Long before the Nilo-Hamitic controversy was resolved, similarities be- 
tween Nilotic (in the broad sense) and certain other languages were recog- 
nized. In 1912 Westermann proposed a Niloto-Sudanic family linking three 
groups of what later became known as Central Sudanic with Nilotic. This 
Niloto-Sudanic family was in turn taken to be one component of a larger 
Eastern Sudanic phylum that also included Nubian, Kunama, and probably 
Berta. In 1920 G. W. Murray proposed affiliation between Nilotic and Nu- 
bian, Nera, Gaam, and Kunama. Carlo Conti Rossini (1926) also connected 
Kunama with various East Sudanic languages (in Greenberg's sense), and 
Westermann (1935) connected Murle with Nilotic. In 1940 A. N. Tucker pre- 
sented evidence connecting five branches of Greenberg's Central Sudanic 
family, though Tucker himself refrained from drawing this obvious conclu- 
sion. The Central Sudanic and Eastern Sudanic families that Greenberg 
postulated in 1950 were thus groupings that had been foreshadowed, albeit 
sometimes ambiguously, in the work of earlier scholars. 

At that time Greenberg did not attempt to link the various languages and 
language groups that would eventually constitute Nilo-Saharan, cautioning 
that "a hypothesis . . . of affiliation between Central Sudanic and Eastern 
Sudanic is as venturesome as the suggestion of a connection between Indo- 
European and Ural-Altaic." Four years later, however, he took the first step 
by proposing a Macro-Sudanic family made up of Central Sudanic, Eastern 
Sudanic, Kunama, and Berta (see List 3.8, above); in 1956 Greenberg 
changed the name Macro-Sudanic to Chari-Nile, which had been sug- 
gested by Welmers. Finally, in 1963, Greenberg offered evidence for an even 
more comprehensive family, which he named Nilo-Saharan, and of which 
Chari-Nile was one of six branches, the other five being two isolates (Song- 
hai, Fur) and three small groups (Saharan, Maban, Koman) (see List 3.30). 
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List 3.30. The Nilo-Saharan Family (Greenberg 1963) 
NILO-SAHARAN: 


I Songhai V CHARI-NILE: 
II SAHARAN А EASTERN SUDANIC 
III MABAN B CENTRAL SUDANIC 
IV Fur C Berta 
D Kunama 
VI KOMAN 





3.4.2 Present Status of Nilo-Saharan Classification 


There have been several developments in Nilo-Saharan classification 
since Greenberg's (1963, 1971) pioneering work, the most important of 
which is the demise of the Chari-Nile node. Shafer (1959), Goodman (1970), 
and Bender (1976) all disputed Greenberg/s claim that Central Sudanic, East 
Sudanic, Kunama, and Berta form a higher-level group (i.e. Chari-Nile). 
Greenberg accepts their arguments to the extent that he now considers 
Central Sudanic and East Sudanic independent branches of Nilo-Saharan, 
but he continues to believe that Kunama and Berta are genetically closest to 
the East Sudanic family. 

Another recent development, mentioned in section 3.3, has been the 
suggestion by several scholars that the Kadugli group, which Greenberg 
(1963) considered distantly related to the Kordofanian languages, may in 
fact turn out to be a member of Nilo-Saharan. 

Of all the languages Greenberg included in Nilo-Saharan, only the af- 
finity of Songhai has elicited serious reservations. As a language clearly 
distant genetically from any other language (or group of languages), Song- 
hai has had a controversial taxonomic history. Delafosse (1914) considered 
it closest to the Mande group, though most linguists of the early twentieth 
century treated it as an isolate with no known relatives. This was Green- 
berg's position in 1950, though three years later he suggested a possibility 
of affiliation with Niger-Congo. In 1963, however, he included it in the 
newly formed Nilo-Saharan phylum. Several scholars have recently revived 
Delafosse's Mande affiliation of Songhai (e.g. Mukarovsky 1966, Creissels 
1981), thus calling into question its inclusion in Nilo-Saharan. Welmers 
(1973: 15) also stresses the relative isolation of Songhai from the rest of 
Nilo-Saharan: "Within Nilo-Saharan, the most basic subdivision appears to 
be between Songhai and all the other languages." Despite these misgivings, 
Greenberg continues to regard Songhai as firmly Nilo-Saharan. 

The classification adopted in this volume follows Greenberg (1963, 1971), 
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List 3.31. The Nilo-Saharan Family (Greenberg 1971, Bender 1982) 


NILO-SAHARAN: 















I Songhai VI CENTRAL SUDANIC: 
II SAHARAN A WEST CENTRAL: 
Ш MABAN 1 BONGO-BAGIRMI 
IV FUR 2 Kresh 
V EAST SUDANIC B EAST CENTRAL: 
А EASTERN: 1 MORU-MADI 
1 NUBIAN 2 MANGBETU-ASUA 
2 SURMA 3 MANGBUTU-EFE 
3 Nera 4 Balendru 
4 EASTERN JEBEL VII Berta 
B WESTERN: VIII Kunama 
1 NYIMANG IX KOMUZ: 
2 TEMEIN А Gumuz 
3 TAMA B KOMAN 
4 DAJU 
C NILOTIC: 
1 WESTERN 
2 EASTERN 
3 SOUTHERN 
D KULIAK 


as modified by Bender (1976, 1982); see List 3.31. Bender's modifications 
include (1) the elimination of the Chari-Nile node in favor of four indepen- 
dent primary branches (i.e. Central Sudanic, East Sudanic, Kunama, and 
Berta); (2) a revision in the subgrouping of East Sudanic that unites Nubian, 
Surma, Nera, and Eastern Jebel in an Eastern branch, and Nyimang, Te- 
mein, Tama, and Daju in a Western branch; (3) moving Kibet from the Tama 
group to the Maban group; (4) grouping Biltine (one of several languages 
that have been called Mimi in earlier literature) with Fur; (5) calling the 
Koman group Komuz to reflect its bipartite nature (= Gumuz + Koman); 
and (6) renaming several branches of East Sudanic (i.e. Didinga-Murle — 
Surma, Ватеа — Nera, Tabi — Eastern Jebel, Teuso — Kuliak). The classifi- 
cation within the constituent groups of Nilo-Saharan follows the sources 
cited in the bibliography with a few exceptions. The Nubian and Daju 
groups are based on unpublished work by Robin Thelwall. It has been sug- 
gested that Central Sudanic has two primary branches, one uniting Bongo- 
Bagirmi and Kresh, the other containing the rest. This was the position of 
Delafosse (1924), followed by Tucker and Bryan (1956); Greenberg (1971: 
434) mentions such a subgrouping as a possibility, but does not commit 
himself to it. 
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Оп the basis of а recent analysis of Nilo-Saharan pronoun systems, 
Bender (1983a) suggests that the nine primary branches of Nilo-Saharan 
in List 3.31 should Бе reduced to six by joining (1) Kunama and Berta and 
(2) Maban, Fur, East Sudanic, and Central Sudanic as higher-level units 
within Nilo-Saharan. Furthermore, Bender considers East Sudanic and 
Central Sudanic the nucleus of the last mentioned group. 
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3.5 KHOISAN 


There are today about 30 Khoisan languages, with a total of 120,000 speak- 
ers. It is believed that Khoisan languages were originally spoken through- 
out most of southern Africa. But the Bantu expansion from the north and 
the Dutch invasion from the south (both by agricultural societies that could 
support large populations, and with advanced weapons to boot) over- 
whelmed the Khoisan peoples, who today eke out their living surrounded 
by Bantu and/or Germanic languages. As seen in Map 3.4, most Khoisan 
languages are found in South Africa, Namibia, Botswana, and Angola. Two 
distantly related languages, however, are spoken in northern Tanzania. 
Within the Khoisan family only Nama (- Hottentot) is at all well known. 
Khoisan languages are renowned for their use of "clicks" as ordinary 
consonants. 
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Map 3.4. The Khoisan family 
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3.5.1 History of Khoisan Classification 


As we saw in section 3.1, the distinctness of the Nama and Bushman 
languages from the surrounding Bantu group was recognized early in the 
nineteenth century by Lichtenstein and others, no doubt in large measure 
because of the distinctive clicks and the physical appearance (Bushmanoid) 
of the speakers. In 1847 John W. Appleyard, in a series of articles on the 
languages of Southern Africa, asserted that “the dialects of the Bushmen 
are very numerous, though all furnish sufficient evidence оҒа common ori- 
gin with those of the Hottentot family” (guoted in Cole 1971: 15). The fact 
that the two groups practice a different mode of existence (cattle herding for 
the Nama, hunting and gathering for the Bushmen) led to their initial sepa- 
ration into two groups, a distinction that was eventually incorporated into 
linguistic classifications, where it had no relevance. 

The first classification of Khoisan languages is usually attributed to 
Wilhelm Bleek (1858), who divided the languages into gender and gen- 
derless groups, a dichotomy that happened to correspond to the Nama 
(gender)-Bushman (genderless) distinction. Or so Bleek thought, for it 
turns out that some Bushman languages do in fact have gender. 

Bleek's work on Khoisan languages was continued by his sister-in-law, 
Lucy Lloyd, and his daughter, Dorothea Bleek. Four years after Wilhelm 
Bleek's death in 1875, Lucy Lloyd began the study of Охй (= !Kung), the 
first such investigation of a language from what has become known as the 
Northern group. The first grammatical description (H. Vedder's grammar of 
Qxû) of a Bushman language did not appear, however, until 1910. 

The 19205 witnessed a flurry of activity on Khoisan languages that laid 
the foundation for all further research. In the early 1920's Albert Drexel pro- 
posed a classification incorporating two click languages of East Africa (San- 
dawe and Hadza) with the Nama and Bushman groups. А major advance 
occurred in 1927, when Dorothea Bleek divided the Bushman languages 
into three groups: Northern, Central, and Southern; two years later she 
showed that Nama, rather than constituting an independent branch of the 
family, belongs instead to the Central group. It was also in the late 1920's 
that the term Khoisan was coined to designate the family as a whole. This 
name combines the Nama word for person (Khoi or Kxoe) with the Nama 
word for the Bushmen (San); it was first proposed in 1928 by the anthropol- 
ogist L. Schultze as a designation for the Bushmanoid racial type. Two 
years later I. Schapera adopted Khoisan as a name for the linguistic family. 

Alongside the developments sketched above, some scholars took a differ- 
ent taxonomic direction, attempting to link Nama with the Hamitic family 
(see section 3.1). Lepsius first proposed this affiliation in 1863, and it was 
incorporated in Meinhof's expanded Hamitic family in the early twentieth 
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century. With the demise of the original Hamitic family (and its extension 
to Nilo-Hamitic as well as Nama), occasioned by Greenberg's midcentury 
classification, the supposed Nama-Hamitic link fell into disfavor and is not, 
to my knowledge, presently adhered to by anyone, unlike Nilo-Hamitic, 
which continues to have a few diehard supporters. 

Greenberg's 1950 classification followed the work of Dorothea Bleek in 
dividing the Khoisan languages into three groups, with Nama included in 
the Central branch. The two East African click languages, Sandawe and 
Hadza, were treated as distantly related to the other Khoisan languages. 
Thus for Greenberg the Khoisan phylum consists of three primary branches, 
two of which are single languages. The third branch contains three sub- 
groups (see List 3.32). 


List 3.32. The Khoisan Family 
(Greenberg 1949—50/1954, 1963) 


KHOISAN: 
I Hadza 
II Sandawe 
HI SOUTHERN AFRICA: 
A NORTHERN 
B CENTRAL 
C SOUTHERN 


3.5.2 Present Status of Khoisan Classification 


The most controversial aspect of Khoisan classification has been the affil- 
iation of the two isolates, Hadza and Sandawe, with the rest of the phylum. 
Although both languages are spoken in relative proximity in Tanzania, 
some 3,000 miles (4,800 km) from the Southern Africa group, they exhibit 
no more similarity to each other than they do to the Khoisan languages 
spoken in the south. The evidence (see Greenberg 1963) that links these ge- 
ographically isolated languages with the bulk of Khoisan tongues farther 
south is not of the overwhelming variety that makes the Bantu, Semitic, or 
Mande families apparent from even a cursory examination of the evidence. 
It is, rather, of a subtle variety that can be discovered only by the patient 
accumulation of a substantial body of data, from which a small but signifi- 
cant number of similarities may be culled. It is the kind of evidence that 
characterizes much of Greenberg's work, in Africa as well as in Oceania 
(see section 5.2) and the New World (see Chapter 6). 

The South African scholar E. O. J. Westphal, a specialist in Khoisan lan- 
guages, disputes the affiliation of Hadza and Sandawe with the other 
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Khoisan languages on essentially two grounds: (1) these two isolates do not 
exhibit regular sound correspondences with the rest of the Khoisan group; and 
(2) if the two isolates (which are found almost side by side some 3,000 miles 
from the other languages) cannot be shown to be related to each other 
within their own specific East African context, then it is impossible to prove 
their genetic affiliation by comparing them with a family of languages 
spoken thousands of miles away. 

Westphal's first objection is a general criticism of Greenberg's work that 
will be dealt with in detail in the following section (3.6). The second objec- 
tion also merits examination, since it involves a fundamental principle of 
genetic classification that is often not fully appreciated. 

Westphal (1971: 382) states categorically that "if Hatsa and Sandawe be- 
long to the same family then this should be apparent from a direct com- 
parison of these two languages." One of the great merits of Greenberg's 
methodology of “mass comparison” is that it is able to overcome obstacles 
that appear insurmountable to the “binarist” (i.e. а linguist who tackles ge- 
netic classification by asking the question, Is language X related to language 
Y?). Greenberg's method is in fact so powerful that it is possible, under cer- 
tain circumstances, to prove that languages are genetically related even if 
they share not a single cognate. 

Let us assume for the moment that Hadza and Sandawe each share 20 
plausible cognates with the Southern Africa Khoisan group, but that they 
share none with each other. Since the links between Hadza and the south- 
ern languages, and between Sandawe and the southern languages, are 
fairly strong, it follows from the principle of transitivity that Hadza and 
Sandawe must themselves be related, since they are demonstrably related 
to the same family, i.e. Southern African Khoisan. But this is obviously a 
conclusion that cannot be obtained if one arbitrarily limits one's perspective 
to the two isolates, between which we have assumed there are no simi- 
larities at all. It is only in the richer context of mass comparison that the 
lines of genetic affiliation can emerge. In fact, the evidence Greenberg 
offered to connect Hadza and Sandawe is considerably stronger than our 
hypothetical figures. Of the 116 etymologies he adduced in support of the 
Khoisan phylum as a whole, Hadza participates in 75, and Sandawe figures 
in 52. Hadza and Sandawe occur together, however, in only 11 of the 
etymologies. 

The inadequacy of the binary approach to language classification has 
been recognized since the nineteenth century. Furthermore, during the de- 
velopment of Indo-European comparative linguistics itself, scholars did not 
attempt to prove the affinity of, say, Albanian and Armenian by a direct 
confrontation of these two languages alone. Rather the proof was invariably 
made in terms of the higher-level unit (Indo-European) to which both be- 
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longed. Тһе same is true of biological classification, іп which, as Greenberg 
(1979: 20) observes: “It has occurred to no biologist . . . to write a special 
treatise to prove that sparrows are genetically related to whales by a direct 
and exclusive comparison between the two. Yet no biologist doubts the rela- 
tionship, since they are both vertebrates." While such ad hoc binary com- 
parisons would be considered a joke in biological taxonomy, they are still 
widely practiced by certain segments of the linguistic community, to the 
dismay and chagrin of more seasoned taxonomists (see section 6.3). 

The Southern African subgroup of Khoisan consists of three distinct 
branches: Northern, Central, and Southern. The first two are internally 
fairly homogeneous; the Southern branch is composed of two distinct fami- 
lies (Таға and !Wi). Kohler (1974: 230) suggests that the Central and North- 
ern groups are genetically closer to each other than either is to the South- 
ern group. Kóhler regards Sandawe as significantly closer to South African 
Khoisan than is Hadza, an opinion shared by Elderkin (1982), who also 
points out resemblances between Hadza and Omotic, Chadic, and Kuliak. 
Westphal (1971), on the other hand, considers the three South African 
groups to be independent families not relatable either to each other or to 
Hadza and Sandawe, on the basis of present evidence. 
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3.6 CRITICISM OF GREENBERG’S METHODOLOGY 


In the 20 years that have passed since the appearance of Greenberg’s 
pivotal 1963 classification of African languages, there has developed in the 
scholarly community a curious ambivalence toward his work. Greenberg’s 
results are now generally accepted by the vast majority of African spe- 
cialists and have constituted the basic framework for research on African 
classification over the past two decades. Furthermore, it is widely acknowl- 
edged that serious work on African classification had been severely im- 
peded by a number of “pseudo-issues,” of which Meinhof's extended 
Hamitic family is probably the most notorious. In addition to providing a 
complete classification of African languages, Greenberg's work was instru- 
mental in identifying and eliminating these spurious issues from the schol- 
arly debate. 

But if Greenberg's results have won general acceptance, his methods, curi- 
ously enough, have not. Indeed, his methodology of "mass comparison," 
which he now prefers to call "multilateral comparison," remains almost as 
controversial today as it was іп the 19505. It is of interest to examine how 
and why this unusual situation has developed, for it illuminates the single 
chief impediment to continued progress in genetic classification, not only 
in Africa, but in the rest of the world as well. 

As Greenberg has stressed since his earliest work, the technique of mul- 
tilateral comparison is the oldest, quickest, and most effective method of 
untangling genetic relationships over a wide area. It consists essentially of 
collecting wordlists, usually restricted to several hundred basic vocabulary 
items, from as wide a range of languages as possible. The vocabulary items 
may be either lexical (e.g. ‘hand,’ ‘sky,’ ‘red,’ “ро” or grammatical (e.g. PLU- 
RAL, PAST TENSE, MASCULINE). Once a substantial number of wordlists has 
been assembled, their comparison inevitably leads to the grouping of lan- 
guages into a certain number of obviously valid genetic units (e.g. Ro- 
mance, Germanic, Semitic, Austronesian, Athabaskan, Algonquian), with 
perhaps a residue of languages not readily classifiable either because they 
are remote from any of the established groups or because there is insuffi- 
cient material available to determine their affiliation. 
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Once the low-level groups have been identified on the basis of numerous 
shared cognates, the next step is the comparison of these groups among 
themselves. Although the similarities will not be as startling as they are at 
the lowest level of classification (see Table 1.2 in section 1.4), significant re- 
semblances among certain of the groups will usually be apparent (see Table 
1.1 in section 1.3). Genetic classification can be continued as long as there 
are groups to compare, but historically there has been a bias against carry- 
ing it beyond fairly obvious groups (e.g. Indo-European, Uralic, Bantu, 
Austronesian, Salish) for reasons that will be discussed both in this section 
and in section 7.3. 

The principal objection to Greenberg's methodology is that, although he 
has discovered substantial numbers of apparent cognates for the groups he 
postulates, these sets of cognates do not exhibit the regular sound correspon- 
dences (see section 2.3) that many linguists have come to regard as "the only 
real proof of genetic relationship" (Welmers 1973: 5). For István Fodor 
(1969), Greenberg's “entire method is vitiated, in fact, annihilated,” by the 
failure to demonstrate "the existence of phonetic laws, which is the stan- 
dard of all comparisons of genetic purpose" (p. 24). According to Fodor, 
“the proof of the affinity of languages is regular correspondence of their 
sounds" (p. 86) and "without sound laws the genetic relations of the Af- 
rican languages cannot be proved" (p. 135). Furthermore, this point of view 
is by no means restricted to Africanists, as the following citations indicate: 


Without sound correspondences among the languages being considered, etymology 
is nothing more than a game and does not constitute proof (Meillet 1937: 462). 


Sauvageot . . . has pointed out as the main weakness of most of the previous works 
in [Ural-Altaic comparative studies] the indiscriminate comparison of words which 
somehow resembled each other. Sauvageot states that only such comparisons count 
which permit of establishing regular sound correspondences (Poppe 1965: 128). 


The interrelationships of North and South Caucasian are as yet uncertain because of 
the absence of any regular sound correspondences between them (Gamkrelidze 
1974: 1011). 


In order to prove that a group of languages is genetically related, it is necessary... 
to detail the regular changes by which each modern language has developed from 
the proto-system. Simply remarking on recurrent similarities and counting possible 
cognates may suggest a genetic relationship, but it does not constitute proof (Dixon 
1980: 221). 


The fundamental criterion for cognation is systematic sound correspondences in 
basic vocabulary (McAlpin 1981: 128). 


More characteristic of prevailing views, at least with respect to African 
classification, is the following measured opinion of Welmers (1973: 5—6): 
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Greenberg has поі, to Бе sure, demonstrated the existence of regular phonetic cor- 
respondences among all the languages in any of the four language families һе posits 
for Africa, though it has already been implied that such correspondences are the 
only real proof of genetic relationship. However, . . . evidence that falls short of 
clear demonstration of regular phonetic correspondences may nevertheless be over- 
whelming. . . . The nature of the similar forms with similar meanings which Green- 
berg cites, and the number of them, is such that the fact of genetic relationship can 
be considered established. 


While accepting Greenberg's results in full, Gregersen (1977: 122) too is 
troubled by the absence of sound correspondences in Greenberg's pro- 
posed etymologies, a defect that he believes "raises the important issue of 
eventually rigorizing the evidence for Greenberg's conclusions" (emphasis 
added). 

The notion that the ultimate proof of genetic affinity is the presence of 
regular sound correspondences represents a peculiar twentieth-century 
distortion of nineteenth-century comparative linguistics, as well as a mis- 
understanding of the nature of scientific proof. In the first place, many 
linguists presently have the mistaken impression that the Indo-European 
family was somehow "proved" by the discovery of regular sound corre- 
spondences: "In the methodology worked out by Indo-Europeanists, only 
the establishment of sound correspondences was taken as proof of genetic 
relationship" (Gregersen 1977: 103). 

Nothing could be further from the truth, for the Indo-European family 
was well established, even with regard to its constituent branches, long be- 
fore sound laws and sound correspondences came into vogue (see section 
2.3). The emphasis that the Neo-Grammarians placed on regular sound 
laws had nothing whatever to do with validating the Indo-European family, 
the existence of which had not been in doubt since the early nineteenth 
century. In fact, as Greenberg, Paul Newman, Ives Goddard, and others 
have stressed, it is the prior recognition of valid genetic groups that leads to 
the discovery of sound correspondences, rather than the other way around: 
"The proof of genetic relationship does not depend on the demonstration 
of historical sound laws. Rather, the discovery of sound laws and the recon- 
struction of linguistic history normally emerge from the careful comparison 
of languages already presumed to be related" (Newman 1970: 39). 

Furthermore, many linguists today are laboring under the delusion that 
regular sound correspondences, which stand out so sharply when one ex- 
amines low-level genetic groups like Bantu, Romance, and Semitic, will 
one day be discovered to the same degree in more ancient groupings such as 
Niger-Congo and Nilo-Saharan. This is essentially what Welmers (1963: 
416) means when he writes that "for all of Greenberg's four families, genetic 
relationship has not yet been demonstrated in the same way as it has for the 
homogeneous Bantu group itself, or for Indo-European." However, as 
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Greenberg (1969: 429) makes clear, there is a whole host of perturbative in- 
fluences on regular sound change (and hence on regular sound correspon- 
dences), accepted by even the strictest Neo-Grammarian, “which must 
produce greater and greater variety of correspondences as time goes on.” 
Like any part of linguistic structure, correspondences are subject to the 
same kinds of change that continually affect all languages everywhere. As 
time goes by, these correspondences are gradually, but continually, ob- 
scured until eventually they become indiscernible. What Greenberg's work 
on genetic classification demonstrates guite clearly is that even after corre- 
spondences have been eroded to the point of invisibility, it is still often pos- 
sible to identify sufficient cognates so that the genetic unity of a group is 
not in doubt. 

Three final points must be touched upon briefly. First, sound correspon- 
dences even at the lowest levels of classification (e.g. Bantu, Romance) are 
not exceptionless laws, as Fodor (1969) somewhat naively believes; and in 
traditional comparative linguistics etymologies are not usually dismissed 
for failing to obey a phonetic law, provided they appear otherwise sound. 
Second, sound correspondences do not discriminate between loanwords 
and true cognates, as is sometimes alleged, with only true cognates obey- 
ing sound laws. In this regard one should not forget the numerous sound 
laws established by Wulff (1934) to link the Tai languages with Sino-Tibetan, 
a relationship now almost universally abandoned, since the Tai "cognates" 
are currently considered to be early loanwords from (and into) Chinese. Fi- 
nally, it is worth remarking that there is no analog to sound correspon- 
dences in biological classification, which is nonetheless recognized as being 
in a more advanced state than linguistic classification. 

If, as has been argued above, regular sound correspondences are neither 
necessary nor sufficient for demonstrating genetic relationship, how are 
we to explain the almost blind faith many linguists place in them as the “ul- 
timate proof" of genetic affinity? The answer lies in a widespread misun- 
derstanding of the nature of scientific proof. Consider what the biologist 
Ernst Mayr (1982: 25— 26) has to say about this notion: 


Descartes endeavored to present only such conclusions and theories as had the cer- 
tainty of a mathematical proof. Although there have always been some dissenters, 
the belief that a scientist had to supply absolute proof for all his findings and theo- 
ries prevailed until modern times. It dominated not only the physical sciences, 
where proof of the nature of a mathematical proof is often possible, but also the 
biological sciences. Even here, inferences are often so conclusive that they can be 
accepted as proof. . . . In many cases, however, and perhaps in the majority of the 
conclusions of the biologists, it is impossible to supply proof of such certainty. . . . 
Eventually the physicists also realized that they could not always give absolute 
proof, . . . and the new theory of science no longer demands it. Instead, scientists 
are satisfied to consider as true either that which appears most probable on the 
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basis of available evidence, ог that which is consistent with more, ог more compel- 
ling, facts than competing hypotheses. 


In this context it becomes clear that the use of sound correspondences to 
“prove” genetic affiliation represents a misguided attempt to provide a 
mathematical proof for a question that is not susceptible to such a proof. 
There is no litmus test for the detection of genetic relationships, as the un- 
successful attempts to use inflectional similarities (in the nineteenth cen- 
tury) and sound correspondences (in the twentieth century) for this pur- 
pose attest. Rather we must be satisfied with “inferences . . . so conclusive 
that they can be accepted as proof." The etymologies that Greenberg offers 
for his African families are of precisely this nature. 
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Asia 


Asia differs dramatically from Africa in the number, diversity, and distri- 
bution of the languages. Whereas in Africa we find many families, each 
containing numerous languages, intertwined in such a way that the genetic 
picture remained muddled into the twentieth century, in Asia the families 
are considerably fewer in number. They usually contain relatively few lan- 
guages, and, practically without erception, they are sharply delineated 
from neighboring families. For all of these reasons most Asian families 
were recognized fairly early in the development of genetic linguistics. For 
northern Asia Strahlenberg had already outlined the major families by 
1730, and in South Asia the picture was clarified during the nineteenth cen- 
tury, at least with regard to the major genetic groups. 

Before we examine these various groups in greater detail, a brief survey 
of Asian families will help to orient the reader. Beginning with the north- 
west corner of Asia, and proceeding roughly counterclockwise, the lin- 
guistic picture presents itself as follows. In the northwest we find the 
Samoyed family, a branch of Uralic (see section 2.4). Several branches of the 
Indo-Hittite family (see section 2.3) are (or were) found in Southwest Asia 
(Iranian, Armenian, Anatolian) or South Asia (Indic, Tocharian). Тһе Altaic 
family, which extends from Turkey across Central Asia into Siberia, is usu- 
ally taken to include the Turkic, Mongolian, and Tungus families, and 
sometimes Korean, Japanese, апа Ainu as well. Тһе Dravidian family is 
found in southern India, with a few scattered outliers farther north. The 
Austroasiatic family comprises two branches, the small Munda family in 
northeast India and the larger, and more diverse, Mon-Khmer family, 
found in Southeast Asia and on the Nicobar Islands in the Bay of Bengal. 
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Languages belonging to the Sino-Tibetan family аге found over а wide area 
of East Asia from Tibet and Nepal throughout China and extending south- 
ward into India, Burma, and Laos. 

Three additional small families are found іп Southeast Asia. Several 
dozen Daic languages cover a broad belt from southern China through Laos 
and Thailand as far south as the Malay Peninsula. The Miao-Yao family con- 
tains only a handful of languages, spoken in numerous scattered enclaves 
in southern China, Vietnam, Laos, and Thailand. The small Chamic group, 
found in southern Vietnam, in Kampuchea, and on the Chinese island of 
Hainan, belongs to the large Austronesian family, which covers most of 
Oceania and will be discussed in section 5.1. 

In North and Northeast Asia we find only a few small families and sev- 
eral language isolates. The Chukchi-Kamchatkan family is spoken on the 
Chukchi and Kamchatkan peninsulas. West of the Chukchi-Kamchatkan 
family, the Yukaghir language is the sole survivor of a previously more nu- 
merous group whose closest relative appears to be the Uralic family (see 
section 2.4). Farther west still, in North Asia, Ket is the sole surviving 
member of the Yeniseian family. Its genetic affiliation is unknown, with the 
most common proposal (Sino-Tibetan) enjoying little support. Several vari- 
eties of Eskimo are spoken in northeastern Siberia, but most Eskimo dia- 
lects are found along the northern perimeter of North America, and they 
will be discussed in that context (see section 6.1). 

Finally, there are three language isolates in Asia. Gilyak is spoken on the 
island of Sakhalin, just off the east coast of Asia. It is most often linked with 
the Chukchi-Kamchatkan and Eskimo-Aleut families farther to the north. 
In this regard Roman Jakobson (1942: 604) suggested a number of plausible 
cognates between Gilyak and Aleut. The other two isolates are found in In- 
dia. Spoken by several hundred people in Northeast India, Nehali is some- 
times grouped with Austroasiatic. Burushaski is spoken by some 50,000 
people in a mountainous region of northern India; its genetic affiliation re- 
mains a complete mystery. 

Though there have been many attempts to link one Asian family with 
another, few have garnered much support. Two such proposals are the 
Austric phylum (originally conceived as including Austronesian and Aus- 
troasiatic) and the Austro-Tai stock (which originally linked Austronesian 
with the Daic family). Both proposals are discussed in section 4.5. 
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4.1 ALTAIC 


The Altaic family is composed of three subfamilies (Turkic, Mongolian, 
Tungus) and three isolated languages (Korean, Japanese, Ainu), as seen in 
Map 4.1. Altogether there are approximately 60 languages and 250 million 
speakers. Some 30 Turkic languages are spoken by 80 million people across 
a wide belt extending from Turkey eastward through the U.S.S.R. into 
northwestern China. With 40 million speakers, Turkish is the best known 
Turkic language, but Uzbek, Azerbaijani, Tatar, Uighur, Kazakh, Turkmen, 
Chuvash, Kirghiz, and Bashkir all have more than a million speakers (and 
in some cases considerably more). A dozen Mongolian languages, with 
three million speakers, are found in Mongolia and surrounding portions of 
the U.S.S.R. and China. One Mongolian language (Moghol) is spoken in 
Afghanistan. Some 16 Tungus languages, with 80,000 speakers, are found 
in scattered "islands" throughout much of Asiatic U.S.S.R. and parts of 
northern China. The best-known Tungus language is Manchu, spoken in 
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Map 4.1. The Altaic family 
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northeastern China. Estimates at the number of Manchu speakers range 
from almost three million to zero (the explanation for this discrepancy is 
that, although there are several million ethnic Manchu in China, the Man- 
chu language is either extinct or on the verge of extinction). Korean (55 mil- 
lion speakers) is spoken primarily in Korea, and Japanese (115 million) pri- 
marily in Japan. Ainu is spoken in northern Japan and on the southern 
portion of Sakhalin Island (U.S.S.R.) to the north. Estimates of 15,000 Ainu 
in Japan and 1,500 in the U.S.S.R. apparently refer to ethnic Ainu; the Ainu 
language is close to extinction. 


4.1.1 History of Altaic Classification 


The study of the Altaic family has had a long and stormy history, and 
even today there is considerable disagreement among specialists over ex- 
actly which languages belong to the family. As I mentioned in section 3.4, 
one often finds in the history of genetic classification that some close-knit 
family serves as a nucleus around which increasingly larger families are 
built. With Altaic we find an unusual development in the opposite direc- 
tion: the original conception of Altaic in the nineteenth century was consid- 
erably broader than it is today. 

The first grammars of individual Altaic languages appeared during the 
seventeenth century; the first classification of languages now considered 
Altaic is found in Strahlenberg (1730), where a Tatar family is posited, as 
shown in List 4.1. Nineteenth-century scholars proposed revisions of this 
family that either expanded its scope (e.g. Rasmus Rask, Max Müller) or 
narrowed it (e.g. M. A. Castrén, W. Schott). In 1834 Rask added Eskimo, 
Chukchi-Kamchatkan, and Basque to the family and renamed it Scythian; 
he also correctly grouped Manchu with Tungus, not Mongolian, a point on 
which Strahlenberg had been in error (see List 4.2). 

Max Müller enlarged Rask's Scythian family in 1855 by including Thai, 


List 4.1. The Tatar Family List 4.2. The Scythian Family 
(Strahlenberg 1730) (Rask 1834) 
TATAR: SCYTHIAN: 
I FINNO-UGRIC I MONGOLIAN 
II TURKIC II MANCHU-TUNGUS 
Ш SAMOYED III TURKIC 
IV MONGOL-MANCHU IV URALIC 
V TUNGUS V ESKIMO 


VI CAUCASIAN VI CHUKCHI-KAMCHATKAN 
точ ле 71252022: 2 VII CAUCASIAN 
ҮШ Basgue 
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List 4.3. The Altaic Family List 4.4. The Ural-Altaic Family 
(Schott 1849) (20th century) 
ALTAIC: URAL-ALTAIC: 
I CHUDIC: I URALIC: 
А FINNO-UGRIC А FINNO-UGRIC 
B SAMOYED B SAMOYED 
II TATAR: II ALTAIC: 
A TURKIC A TURKIC 
B MONGOLIAN B MONGOLIAN 


C TUNGUS C TUNGUS 


Tibetan, Dravidian, and Malay; he renamed the family Turanian. Rask's and 
Müller's proposals turned out to be deadends; it was the work of Castrén 
and Schott, both of whom pruned Caucasian from Strahlenberg's Tatar fam- 
ily, that led to contemporary views. Both Castrén and Schott rejected the 
facile typological approach and insisted on lexical and morphological simi- 
larities; that is, they insisted on the same standard of proof that had been 
required of Indo-European. Both men reached their conclusions around the 
middle of the nineteenth century. In 1849 Schott proposed an Altaic (or 
Chudic-Tatar) family composed of a Chudic (or Finnic) branch and a Tatar 
branch, as in List 4.3. 

Castrén’s views, which were not published until 1862, 10 years after his 
death, were very similar to Schott's, with the exception that he believed 
Turkic to be closer to Chudic than to Mongolian or Tungus. On this point 
modern scholarship has sided with Schott. Schott's classification is in fact 
quite modern in conception, though his terminology is no longer used. 
What he called Altaic is now known as Ural-Altaic; Chudic has been re- 
placed by Uralic (see section 2.4); and Tatar corresponds to the modern 
definition of Altaic (see List 4.4). 

During the century following Schott's and Castrén's work, the Ural-Altaic 
family (to use the modern terminology) developed in two directions. First, 
the validity of the genetic link between Uralic and Altaic was increasingly 
questioned, and the most widespread opinion today rejects such a link. 
Nevertheless, there has always been a small group of scholars who have not 
only steadfastly defended the Ural-Altaic family, but even expanded it 
to include Chukchi-Kamchatkan, Eskimo-Aleut, Indo-European (and, for 
some, Dravidian, South Caucasian, and Semitic). In addition, Korean, Japa- 
nese, Ainu, and Gilyak are frequently included in the expanded family, 
which is known most commonly as Nostratic or Eurasiatic. The question of 
possible genetic links between well-established families (of which the Ural- 
Altaic hypothesis is one example) is discussed in section 7.3. 
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The second line of development has been the expansion of Altaic Proper 
(= Turkic, Mongolian, and Tungus) to include Korean, Japanese, and Ainu. 
Korean was first connected with Altaic in 1823 by H. J. Klaproth. The Aus- 
trian scholar Anton Boller sought to show in 1857 that both Korean and 
Japanese were related to the Ural-Altaic family. In the late 1920's the Russian 
linguist E. D. Polivanov and the Finnish linguist G. J. Ramstedt (usually 
regarded as the founder of comparative Altaic linguistics) both offered 
evidence linking Korean to Altaic Proper. In 1966 the American linguist 
Samuel E. Martin proposed more than 300 etymologies connecting Korean 
with Japanese (an idea first suggested in 1717), and in 1971 another Ameri- 
can linguist, Roy A. Miller, sought to demonstrate the affinity of Japanese 
with both Korean and Altaic Proper. 

Ainu has been associated with Altaic less often than Korean or Japanese, 
though Street (1962) suggested such a relationship in the context of a larger 
North Asiatic family. Recently Patrie (1982) has adduced considerable evi- 
dence linking Ainu with the rest of Altaic, including both Japanese and 
Korean. 


4.1.2 Present Status of Altaic Classification 


There is no consensus today on either the membership or the subgroup- 
ing of the Altaic family. At one extreme are those linguists who reject even 
Altaic Proper as a valid genetic group (e.g. Clauson 1956, Doerfer 1963, 
Comrie 1981); at the other extreme are those who believe Altaic Proper is 
genetically related both to languages farther west (Uralic, Indo-European) 
and farther east (Korean, Japanese, Ainu, Gilyak, Chukchi-Kamchatkan, 
Eskimo-Aleut) in a family called Eurasiatic (e.g. Greenberg 1979, 1986). Be- 
tween these two extremes lie a number of intermediate proposals, some of 
which are illustrated in List 4.5. 

In weighing the scholarly support for these various configurations, I con- 
clude that Altaic Proper (= Turkic, Mongolian, Tungus) is now widely ac- 
cepted as a valid genetic group. The affiliation of Korean with this complex 
has also gained considerable support, and Japanese appears to be gaining 
in popularity as a member of Altaic, though still in a minority position. 
Ainu has been mentioned less often in connection with Altaic, but Patrie 
(1982) changes this state of affairs significantly. Looking westward, the de- 
fenders of the Uralic connection are in a decreasing minority. Those who 
include even more distant groups such as Indo-European, Chukchi-Kam- 
chatkan, or Eskimo-Aleut are seldom accorded much attention by Altaicists. 

The present unresolved state of Altaic classification has several causes. 
First of all, except in the case of Altaic Proper, the genetic affinity of the 
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List 4.5. Proposed Subgroupings of the Altaic Family 


Schott 1849 Street 1962 
ALTAIC: NORTH ASIATIC: 
І TURKIC I KOREAN-JAPANESE-AINU: 
II MONGOLIAN A Ainu 
Ш TUNGUS B KOREAN-JAPANESE: 
1 Korean 
Ramstedt 1952-57 2 Japanese 
ALTAIC: H ALTAIC: 
1 TURKIC A EAST: 
1 TUNGUS 
П MONGOLIAN 
III TUNGUS 2 MONGOLIAN 
IV Korean B WEST: 
1 Chuvash 
2 TURKIC 
Poppe 1965 Miller 1971 
ALTAIC: ALTAIC: 
I Korean I WESTERN: 
II ALTAIC PROPER: A Chuvash 
A TURKIC: B TURKIC 
1 Chuvash П EASTERN: 
2 ТОККІС PROPER А MONGOLIAN 
В MONGOLIAN-TUNGUS: В TUNGUS-KOREAN-JAPANESE: 
1 MONGOLIAN 1 TUNGUS 
2 TUNGUS 2 KOREAN-JAPANESE: 
a Korean 
b Japanese 


various groups and languages is not obvious. Second, because of the dis- 
tant relationships involved, as well as the broad geographic distribution of 
the languages in question, Altaic classification has all too often been inves- 
tigated with a "binary approach" (see section 3.5), where the focus of the 
investigation has been artificially restricted to those languages in which a 
particular scholar happens to have expertise. Although binary comparisons 
may suggest a genetic connection between two languages (or groups of lan- 
guages), they do not tell us the nature of that link, thus producing a mud- 
dled picture with respect to subgrouping. What Altaicists frequently over- 
look is that even if Korean (or Japanese) is related to Altaic Proper, this does 
not necessarily mean that Korean and Altaic Proper constitute a valid ge- 
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netic group. English, French, апа Spanish аге all related, but they form по 
valid group. As will be made clear in section 7.3, the failure to study the 
Altaic problem іп а sufficiently broad context is primarily responsible for 
the present unsettled state of affairs. Third, Altaic comparative linguistics 
has been plagued by the fallacy of negative evidence. Some critics of Altaic 
Proper, for example, point out that whereas Turkic and Mongolian have 
very similar color words, the corresponding terms im Tungus are totally dif- 
ferent. Therefore, they claim, the Altaic affinity of Tungus is called into 
question. Though arguments of this nature are surprisingly common (and 
not just in Altaic studies), they are of no importance, because no one either 
pretends or demands that related languages be similar in every detail. Lex- 
ical innovation, through either borrowing or invention, is a fact of life that 
in no way undermines genetic classification. Finally, the fact that most 
members, or putative members, of Altaic are either language isolates (e.g. 
Korean, Ainu) or small homogeneous groups (e.g. Turkic, Mongolian, Japa- 
nese) renders the classificatory problems more difficult than with large di- 
verse groups. 

The Altaic classification adopted in this volume is similar in conception 
to Street (1962) and Patrie (1982), though with a slightly different termi- 
nology (see List 4.6). The validity of defining Altaic in this manner will be 
examined again in section 7.3 in the context of an even broader Eurasiatic 
family. 


List 4.6. The Altaic Family List 4.7. Тһе Turkic Family 
(Street 1962, Patrie 1982) (Voegelin & Voegelin 1977) 
ALTAIC: TURKIC: 
I ALTAICPROPER: I BOLGAR: Chuvash 
A TURKIC П СОММОМ TURKIC: 
В MONGOLIAN-TUNGUS: А SOUTHERN 
1 MONGOLIAN В EASTERN 
2 TUNGUS С WESTERN 
П KOREAN-JAPANESE: D CENTRAL 
A Korean E NORTHERN 
B Ainu 
C JAPANESE-RYUKYUAN: 
1 Japanese 
2 Ryukyuan 


Let us turn now to subgrouping within the various constituents of Al- 
taic. With respect to Altaic Proper there have been two principal views, 
turning on whether Mongolian is closer to Turkic (e.g. Vladimirtsov 1929) 
or to Tungus (e.g. Street 1962, Poppe 1965, Miller 1971). 


4.1 Altaic 133 


Of the three families constituting Altaic Proper, Turkic and Mongolian 
are both extremely homogeneous, and Tungus is apparently only slightly 
less so. According to Poppe (1965: 107), contemporary views on Turkic de- 
rive from the classification of Alexandr Samojlovich (1922), "which, with 
some minor changes, is the basis for present-day classification." Within 
Turkic a single language, Chuvash (the sole surviving member of the Bolgar 
group), stands at some distance from the rest of the family. Of the five geo- 
graphic groups making up Common Turkic, the Northern is the most diver- 
gent. The classification followed here (List 4.7) is based on Voegelin and 
Voegelin (1977), with certain modifications suggested by Robert Underhill 
(personal communication). 

Within the Mongolian family Moghol is isolated from the rest, and 
Dagur and the Monguor languages are somewhat less distant from the very 
homogeneous Oirat-Khalkha nucleus. The classification adopted in this 
volume (List 4.8) follows Binnick (1981). 


List 4.8. The Mongolian Family List 4.9. The Tungus Family 
(Binnick 1981) (Voegelin & Voegelin 1977) 
MONGOLIAN: TUNGUS: 
I WESTERN: Moghol I NORTHERN 
II EASTERN: II SOUTHERN: 
A Dagur A SOUTHWESTERN 
B MONGUOR B SOUTHEASTERN: 
C OIRAT-KHALKHA: 1 МАМА) 
1 OIRAT-KALMYK 2 UDIHE 


2 KHALKHA-BURIAT 


Тһе Tungus family is divided into distinct Northern and Southern 
branches. Although speakers of Northern languages are found over an 
enormous area covering most of Siberia, the relatively compact Southern 
group displays far greater diversity, being itself bifurcated into Southwest- 
ern and Southeastern groups. The classification followed, shown in List 
4.9, comes from Voegelin and Voegelin (1977). 
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4.2 CHUKCHI-KAMCHATKAN 


Five Chukchi-Kamchatkan languages, with a total of 23,000 speakers, are 
found in northeastern Siberia, primarily on the Chukchi and Kamchatkan 
peninsulas (see Map 4.2). According to Jakobson (1942: 602), the Chukchi- 
Kamchatkan (= Luorawetlan) family was first perceived by S. Krashenin- 
nikov, who connected Chukchi and Koryak in 1775. In 1798 N. Cherepanov 
added Kamchadal to the group. The first serious treatise on these lan- 
guages was published by L. Radloff in 1861, and the first comparative study 
of the family was made by W. Bogoras in 1922. 
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In the twentieth century two additional members of the group have been 
recognized, both of which were previously treated as dialects of Koryak. 
Kerek became isolated from the other Koryak dialects and has undergone 
substantial Chukchi influence. Alyutor has only recently been recognized 
by the Soviet authorities as a language distinct from Koryak. The sub- 
grouping of the family is fairly straightforward. There is a basic, and deep, 
division between a Southern branch, with Kamchadal as the sole member, 
and a Northern branch comprising the other four languages. Within the 
Northern group Chukchi apparently stands somewhat apart from the other 
three (see List 4.10). 


List 4.10. The Chukchi-Kamchatkan Family 


CHUKCHI-KAMCHATKAN: 
I NORTHERN: 
A Chukchi 
B KORYAK: 
1 Kerek 
2 Koryak 
3 Alyutor 
П SOUTHERN: Kamchadal 


The most promising external links for the Chukchi-Kamchatkan family 
appear to lie with the Eskimo-Aleut family and the Gilyak isolate. These 
possibilities are discussed in section 7.3. 
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4.3 DRAVIDIAN 


There are about two dozen Dravidian languages, with a total of perhaps 
145 million speakers. As shown in Map 4.3, they are found predominantly 
in southern India, but there are a few outliers, separated from the main 
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body of Dravidian speakers, in eastern India (Malto, Kurux, Kui) and 
southern Pakistan (Brahui). The extinct language Elamite, formerly spoken 
in what is today southwestern Iran, has also been considered distantly re- 
lated to the Dravidian family. The best-known Dravidian languages are 
Telugu, Tamil, Malayalam, and Kannada. They are the only Dravidian lan- 
guages with literary traditions dating from pre-Christian times, and to- 
gether they account for over 95 percent of the Dravidian population. 


4.3.1 History of Dravidian Classification 


Although grammars of individual Dravidian languages appeared as 
early as the seventeenth century, the family itself was not recognized until 
1816, when a British civil servant stationed in India, Francis W. Ellis, pro- 
posed the genetic affinity of the four literary Dravidian languages (Tamil, 
Malayalam, Kannada, Telugu) and three nonliterary tongues (Tulu, Kodagu, 
Malto). Comparative Dravidian linguistics began, auspiciously, four dec- 
ades later with the publication in 1856 of Robert A. Caldwell's Comparative 
Grammar. Caldwell coined the term Dravidian (based on a Sanskrit word for 
Tamil); added Kota, Toda, Gondi, Kui, Kurux, and Brahui to the seven lan- 
guages enumerated by Ellis; and worked out the rudiments of comparative 
Dravidian linguistics. Caldwell was somewhat unclear regarding the rela- 
tionship of Brahui to the rest of the family. Spoken in what is today south- 
west Pakistan, more than 800 miles (1,280 km) from the nearest Dravidian 
language, Brahui had, according to Caldwell, "a Dravidian element" in its 
vocabulary, but he stopped short of granting it full-fledged membership in 


List 4.11. The Dravidian Family (Konow 1906) 


DRAVIDIAN: 
I NORTH: Brahui 
II CENTRAL AND SOUTH: 
A CENTRAL: 
1 GONDI-TELUGU: Gondi, Kui, 
Kolami, Naiki, Telugu 
2 KURUX-MALTO: Kurux, Malto 
B SOUTH: 
1 TODA-KANNADA: 
a KANNADA 
b TODA-KOTA: Toda, Kota 
2 Kodagu 
3 Tulu 
4 TAMIL-MALAYALAM: Tamil, 
Malayalam 
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List 4.12. The Dravidian Family (Zvelebil 1974) 


DRAVIDIAN: 
I NORTH: 
А Brahui 
B KURUX-MALTO 
II CENTRAL: 
A KOLAMI-PARJI 
В TELUGU-KUI: 
1 Telugu 
2 GONDI-KUI 
III SOUTH: 
А Tulu 
B TAMIL-KANNADA: 
1 KANNADA 
2 TAMIL-TODA: 
a KOTA-TODA 
b TAMIL-KODAGU: 
i Kodagu 
ii TAMIL-MALAYALAM 


the Dravidian family. Nevertheless, by the end of the nineteenth century, it 
had become apparent that Brahui was in fact Dravidian, though perhaps 
the family's most divergent member. This was the position taken in Konow 
(1906: 285), where the Dravidian subgrouping in List 4.11 was proposed. 

The first half of the twentieth century saw little progress in Dravidian 
classification, but during the 1960's contributions by several linguists (in- 
cluding M. B. Emeneau, В. Krishnamurti, and P. 5. Subrahmanyam) led to a 
new view of Dravidian subgrouping containing three primary branches, as 
given in List 4.12. 


4.3.2 Present Status of Dravidian Classification 


In the past decade the initial enthusiasm for the tripartite subgrouping 
of List 4.12 was tempered by a growing disenchantment with certain as- 
pects of the classification. McAlpin (1981: 21), summarizing these develop- 
ments, cites two major defects. First, Brahui and Kurux-Malto do not con- 
stitute a valid genetic group. The evidence supporting this grouping (i.e. 
Emeneau 1962) was scant from the start, and the principal shared innova- 
tion linking the two was shown during the 1970's to be independent, but 
convergent developments. Second, Kolami-Parji and Telugu-Kui do not 
form a valid genetic node; Krishnamurti (1969) suggests that Telugu-Kui 
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List 4.13. The Dravidian Family (Andronov 1978, McAlpin 1981) 


Andronov 1978 McAlpin 1981 
DRAVIDIAN: DRAVIDIAN: 
I Brahui I Brahui 
П DRAVIDIAN PROPER: H DRAVIDIAN PROPER: 
A KURUX-MALTO A KURUX-MALTO 
B CENTRAL AND SOUTH: B CENTRAL AND SOUTH: 
1 GONDI-KUI 1 KOLAMI-PARJI 
2 KOLAMI-TAMIL: 2 TELUGU-KUI 
a KOLAMI-PARJI 3 TAMIL-TULU (= SOUTH 
b TELUGU-TAMIL: DRAVIDIAN) 
i Telugu 
ii TULU-TAMITL: 
о Tulu 
B KANNADA- 
TAMIL 


may be closer to South Dravidian than to Kolami-Parji. Thus, for both 
North and Central Dravidian the requisite shared innovations are lacking. 
There are, according to McAlpin, five secure nodes within the Dravidian 
family: Brahui, Kurux-Malto, Kolami-Parji, Telugu-Kui, and South Dravi- 
dian. Precisely how these five nodes fit together is unresolved; two recent 
proposals are shown in List 4.13. Languages whose precise position in the 
family continues to be controversial are Telugu (formerly believed to be 
most closely related to South Dravidian, but now subgrouped with Gondi- 
Kui), Tulu (apparently standing at some distance from the rest of South 
Dravidian), and a few preliterate languages only recently discovered (e.g. 
Koraga, Kurumba, Bellari). 

A final development in the 1970's saw the revival of the theory that the 
Dravidian family is related to the extinct Elamite language, spoken from 
roughly 2,000 в.с. to 500 в.с. in what is today southwest Iran. This hy- 
pothesis was first put forth in the mid-nineteenth century by several differ- 
ent scholars, including Caldwell. During the 1970's the hypothesis was re- 
discovered and elaborated by McAlpin; for a detailed summary of these 
developments, consult McAlpin (1981). 

The Dravidian classification adopted in this volume (List 4.14) is a slightly 
modified version of McAlpin's subgrouping, in which Central Dravidian 
has been retained as a genetic unit, but North Dravidian has not. 

For a discussion of proposed genetic links between Dravidian and other 
language families, see section 7.3 below. 
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List 4.14. Тһе Elamo-Dravidian Family 
(McAlpin 1981 and others) 


ELAMO-DRAVIDIAN: 
I tElamite 
II DRAVIDIAN: 
A NORTHWEST: Brahui 
B DRAVIDIAN PROPER: 
1 NORTHEAST (- KURUK-MALTO) 
2 CENTRAL: 
a KOLAMI-PARJI 
b TELUGU-KUI 
3 SOUTH: 
a Tulu 
b TAMIL-KANNADA 


Andronov, M. S. 1965a. Dravidijskie jazyki. Moscow. 

. 1965b. "Lexicostatistic Analysis of the Chronology of Disintegration of 

Proto-Dravidian," Indo-Iranian Journal 7: 170—86. 

. 1978. Sravnitel’naja grammatika dravidijskix jazykov. Moscow. 

Caldwell, Robert A. 1856. A Comparative Grammar of the Dravidian or South-Indian 
Family of Languages. London. 

Emeneau, Murray B. 1962. Brahui and Dravidian Comparative Grammar. UCPL 27. 

. 1969. "The Non-Literary Dravidian Languages," in CTIL 5. 

Konow, Sten. 1906. Munda and Dravidian Languages, vol. 4 of George A. Grierson, 
ed., Linguistic Survey of India. Delhi. 

Krishnamurti, B. 1969. "Comparative Dravidian Studies," in CTIL 5. 

McAlpin, David W. 1974. “Toward Proto-Elamo-Dravidian," Language 50: 89-101. 

. 1975. “Elamite and Dravidian: Further Evidence of Relationship," Current 

Anthropology 16: 105—15. 

. 1980. "Is Brahui Really Dravidian?" Proceedings of the Berkeley Linguistic So- 

ciety 6: 66-72. 

. 1981. Proto-Elamo-Dravidian: Тһе Evidence апа Its Implications. Philadelphia. 

Subrahmanyam, P. S. 1969. “Тһе Central Dravidian Languages," іп S. Agesthialin- 
gom апа М. Kumaraswami Raja, eds., Dravidian Linguistics. Annamalainagar. 

Zvelebil, Kamil V. 1970. Comparative Dravidian Phonology. The Hague. 

. 1974. “Dravidian Languages,” іп ЕВ 5. 

. 1977. A Sketch of Comparative Dravidian Morphology 1. The Hague. 


























4.4 SINO-TIBETAN 


After Indo-Hittite, the Sino-Tibetan family is the most populous lan- 
guage family in the world, with close to a billion speakers. Sino-Tibetan 
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languages are found primarily in China, India, Nepal, and Burma. To a 
lesser degree they are also found in the northern parts of Thailand, Laos, 
and Vietnam. Their geographic distribution is shown in Map 4.4. Except 
for Chinese, however, few of the languages have many speakers. Estimates 
for the number of speakers of each of the Chinese "dialects" (really separate 
languages) are difficult to come by. By my estimate there are currently some 
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927 million total Chinese speakers for the eight languages into which the 
Chinese continuum has been (somewhat arbitrarily) divided, with а distri- 
bution roughly as follows: Mandarin (680), Wu (69), Yue (= Cantonese, 
53), Southern Min (36), Xiang (29), Kejia (28), Gan (21), Northern Min (11). 
These figures may be too low. Except for Chinese, however, few Sino- 
Tibetan languages have many speakers or are well known. Burmese, with 
some 22 million speakers, and Tibetan, with four million, are the only ex- 
ceptions. The Hani language, spoken in southern China by more than one 
million people, is the only other Sino-Tibetan language with a million 
speakers, but it is hardly well known. Most Sino-Tibetan languages are 
spoken by small numbers of people, and their names are known only to 
specialists. 


4.4.1 History of Sino-Tibetan Classification 


In the sources I have been able to consult, the early history of the Sino- 
Tibetan family remains regrettably vague. What is clear is that during the 
nineteenth century there first appears in the linguistic literature an Indo- 
Chinese family, defined for the most part typologically by traits like tone 
and monosyllabism. Its membership varied somewhat from scholar to 
scholar but followed the general pattern shown in List 4.15. Most scholars 
excluded Mon-Khmer from this family, and the affinity of the small Miao- 
Yao group was also problematical. The Tai languages appeared to have a 
special relationship with Chinese. According to Konow (1909: 12), the large 
and diverse Tibeto-Burman family was first recognized in 1828 by B. H. 
Hodgson, and the first Tibeto-Burman classification was attempted by Max 
Müller in 1854. At the end of the century, August Conrady proposed a fam- 
ily uniting Chinese, Tai, and Tibeto-Burman (List 4.16). 

In the early twentieth century Konow (1909: 1), among others, noted that 
the subfamilies lumped together under the rubric Indo-Chinese belonged 
to two distinct families: (1) Mon-Khmer and (2) Chinese, Tai, and Tibeto- 


List 4.15. The Indo-Chinese Family List 4.16. The Indo-Chinese Family 
(19th century) (Conrady 1896) 
INDO-CHINESE: INDO-CHINESE: 
I CHINESE I SINITIC (= CHINESE) 
II TAI II TAI 
III KAREN III TIBETO-BURMAN 
IV TIBETO-BURMAN 
V MIAO-YAO 


?VI MON-KHMER 
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List 4.17. Proposed Subgroupings of the Sino-Tibetan Family 


Konow 1909 Li 1937 
SINO-TIBETAN: SINO-TIBETAN: 
I SINO-TAI: I SINO-TAI: 
A SINITIC A SINITIC 
B TAI B TAI 
II TIBETO-BURMAN C MIAO-YAO 
II TIBETO-BURMAN 
Benedict 1942 Shafer 1955 
SINO-TIBETAN: SINO-TIBETAN: 
I SINITIC I SINITIC 
П TIBETO-KAREN: > TAI 
A KAREN III BODIC 
B TIBETO-BURMAN IV BURMIC 
V BARIC 
VI KAREN 


Burman. For the latter family Konow proposed the subgrouping shown in 
List 4.17. Fang-kuei Li presented a similar classification in 1937, though he 
included Miao-Yao (List 4.17). 

Although a few scholars were skeptical of the Chinese-Tai connection, it 
was largely accepted until 1942, when a young American linguist, Paul K. 
Benedict, proposed that the nearest affiliation of the Tai group was with the 
Austronesian family and not with Sino-Tibetan. In fact a similar opinion 
had been offered by Gustav Schlegel as early as 1902. Whether the roots 
shared by Chinese and Tai are merely early loans, as Benedict maintains, or 
represent the residue of a common origin, as suggested by Manomaivibool 
(1976) and others, continues to divide the linguistic community. Further- 
more, even if these roots are loans, the direction of borrowing is often in 
dispute. Nevertheless, whether or not Chinese and Tai are genetically re- 
lated at some level, it would appear that Tai's closest relatives are the Aus- 
tronesian languages. 

For Benedict (1942: 600) the Sino-Tibetan family was constituted as 
shown in List 4.17. He explicitly excluded Miao-Yao from Sino-Tibetan, 
linking it tentatively with Tai, Austronesian, and (until 1966) Austroasiatic 
in an Austric phylum (see section 4.5). 

In 1955 Robert Shafer presented a classification of Sino-Tibetan in which 
both the Sino-Tai and Tibeto-Burman nodes were discounted. In addition, 
Shafer expressed reservations about the inclusion of Tai (see List 4.17). 
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4.4.2 Present Status of Sino-Tibetan Classification 


Renewed interest іп classificatory problems during the late 19605 and the 
19705 has led to а growing consensus that Sino-Tibetan is made up of just 
three subfamilies: Sinitic, Karen, and Tibeto-Burman. Those who would 
also include Tai or Miao-Yao appear to Бе in a decreasing minority; Benedict 
(1976: 172) excludes both categorically: “The real problem . . . has always 
been why anyone . . . has ever seriously taken the Kam-Tai and/or M[iao]- 
Y[ao] languages to be true ‘blood cousins’ of S[ino]-T[ibetan], given the al- 
most total lack of any basic ties in the respective lexicons." 

There is agreement that Karen is closer to Tibeto-Burman than to Sinitic 
within Sino-Tibetan, but whether it is taxonomically coordinate with Tibeto- 
Burman, as in Benedict's proposal (see List 4.17), or simply a subgroup 
thereof is unresolved. Of the three constituents that make up Sino-Tibetan, 
both Sinitic and Karen are small, very close-knit families whose genetic 
unity is clear. Traditionally Sinitic and Chinese have been considered syno- 
nyms. However, Greenberg (1953) remarks, concerning the Bai (= Minchia) 
language(s), that “when the obvious Chinese borrowings are accounted for, 
the language still appears to show a special affinity to Chinese in funda- 
mentals, so that it should probably be included in the Sinitic subbranch.” 
His suggestion was generally ignored, and Voegelin and Voegelin (1977: 90) 
classified Bai as a member of Burmese-Lolo within Tibeto-Burman. Accord- 
ing to Benedict (1983), new material on the Bai language(s) indicates that 
Greenberg was correct, and that Sinitic consists of two branches, Chinese 
and Bai. The third constituent, Tibeto-Burman, is large and more diverse, 
but still well defined, though the precise relationship of Karen to the Tibeto- 
Burman nucleus remains a problem. The internal structure of Tibeto- 
Burman is not at all clear, as the various subgrouping proposals in List 4.18 
suggest. 

The Sino-Tibetan classification adopted in this volume (List 4.19) follows 
Benedict (1942, 1972, 1983) for the overall subgrouping, but the Tibeto- 
Burman family is based primarily on Shafer (1955, 1974). The classification 
is thus similar to Egerod's (1974). In addition, certain modifications have 
been made on the basis of Matisoff (1980) and Bradley (1976). The Tibeto- 
Burman subgrouping should be considered provisional, for it is certain that 
future research will necessitate substantial revision. For a comprehensive 
review of different Tibeto-Burman subgroupings, consult Hale (1982). 

Let us close this section by examining a curious development in the schol- 
arly appraisal of Sino-Tibetan during the twentieth century. For Konow 
(1909: 2), Tibeto-Burman was "very closely related" to Sinitic. Three dec- 
ades later Benedict (in a work not published until 1972) termed this same 
relationship "a distant one" (p. 2). Recently Lehman (1975: 218) implied 


List 4.18. Proposed Subgroupings of the Tibeto-Burman Family 


Konow 1909 Shafer 1955 
TIBETO-BURMAN: TIBETO-BURMAN:" 
I TIBETAN I TIBETIC 
II HIMALAYAN II BARIC 
III NORTH ASSAM III BURMIC 
IV BODO 
V NAGA 
VI Kachin 
VII KUKI-CHIN 
VIII BURMESE 
Voegelin & Voegelin 1977 


Benedict 1972 


TIBETO-BURMAN: 
I TIBETAN-KANAURI I TIBETAN 
II BAHING-VAYU II GYARUNG-MISHMI 
III ABOR-MIRI-DAFLA Ш BURMESE-LOLO 
IV BURMESE-LOLO IV NAGA-KUKI-CHIN 
V Kachin V BODO-NAGA-KACHIN 
VI KUKI-NAGA VI KAREN 
VII BARIC 
"Shafer did not posit a TIBETO-BURMAN node, but these three 
groups are what remains after one excludes SINITIC, TAI, and 
KAREN from his Sino-Tibetan family. 


TIBETO-BURMAN: 


List 4.19. The Sino-Tibetan Family (Benedict 1942, 
Shafer 1955, and others) 


SINO-TIBETAN: 
I SINITIC: 
A BAI 
B CHINESE 
II TIBETO-KAREN: 


A KAREN 
B TIBETO-BURMAN: 


1 TIBETIC 
2 BARIC 
3 BURMIC: 
a KUKI-NAGA 
b KACHIN-LUIC 
c BURMESE-MOSO: 
i MOSO 
ii BURMESE-LOLO: 
а BURMIC 
B LOLO 
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that the Sino-Tibetan family is as yet unproved: “While І fully subscribe to 
the view that actual proof of genetic relationship is founded upon explicit 
Lautlehren [sound laws], I believe that B[enedict] has all but proved the 
S[ino]-T[ibetan] hypothesis. He has listed so many fairly obvious cognate 
sets... that their cumulative effect alone precludes serious doubt." 

How can it be that the dramatic increase in both the quantity and quality 
of Sino-Tibetan sources during this century seems to have had the effect of 
eroding confidence in the Sino-Tibetan family? Have these new materials 
really called into question the validity of Sino-Tibetan? Of course not. What 
has happened in the study of Sino-Tibetan parallels what happened in Af- 
rican classification: the standard of proof of genetic affinity has been modi- 
fied to such an extent by the requirement of regular sound correspon- 
dences that even such obvious groupings as Sino-Tibetan fail to meet the 
new requirements. The striking similarity between the quotation from 
Lehman above and the one from Welmers on African languages (page 121) 
illustrates the pervasiveness of the current malaise caused by the insistence 
on sound correspondences as proof of genetic affinity. Further discussion 
of this issue may be found in section 3.6. 
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4.5 AUSTRIC: MIAO-YAO, AUSTROASIATIC, DAIC 


The Austric phylum is a controversial grouping that consolidates four 
families: Miao-Yao, Austroasiatic, Daic (= Kadai), and Austronesian. Alto- 
gether the Austric phylum contains close to 1,200 languages, with a total of 
around 293 million speakers. In this section we will discuss the Austric hy- 
pothesis and its three Asiatic subfamilies, Miao-Yao, Austroasiatic, and 
Daic (see Map 4.5). The Austronesian family is dealt with in section 5.1. 

Four Miao-Yao languages are spoken in scattered "islands" throughout 
southern China (mainly Guizhou Province) and the northern regions of 
Vietnam, Laos, and Thailand. The Miao language is composed of several 
divergent dialects with perhaps a total of five million speakers. Mien (= Yao) 
has about one million. 

The Austroasiatic family has approximately 150 languages and 56 million 
speakers, unequally divided into two important subfamilies, Munda and 
Mon-Khmer. Ten Munda languages are spoken in northeastern India by six 
million people; Santali (4 million) and Mundari (2 million) account for most 
of the speakers. Mon-Khmer languages are spoken throughout much of 
Southeast Asia, primarily in Laos, Vietnam, and Kampuchea, but also in 
Thailand, Burma, the Malay Peninsula, and the Nicobar Islands in the An- 
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Map 4.5. The Miao-Yao, Austroasiatic, and Daic families 


daman Sea. One Mon-Khmer language (Khasi) is found in northeast India. 
There are about 140 Mon-Khmer languages, with a total of 50 million 
speakers, but only Vietnamese (47 million) and Khmer (= Cambodian, 
7 million) are well known. 

The Daic family has around 60 languages, which are spoken by an esti- 
mated 50 million people in Thailand, Laos, northern Vietnam, Burma, and 
southern China, and also on Hainan Island in the Gulf of Tonkin. The best- 
known languages are Thai (= Siamese, 30 million speakers) and Lao 
(= Laotian, 17 million). Other Daic languages with more than a million 
speakers are Zhuang (13 million), Northern Tai (6 million), Southern Tai 
(4 million), Bouyei (2 million), Shan (2 million), Zhongjia (2 million), Kam 
(1.5 million), and Li (1 million). 


4.5.1 History of Austric Classification 


The linguistic relationships within Southeast Asia remained obscure un- 
til the second half of the nineteenth century, when the genetic picture 
slowly began to be delineated. The process was gradual; only after roughly 
a century of work (c. 1850—1950) were the basic outlines clear, and indeed 
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even today there are guestions that have not been fully resolved. The his- 
tory of Austric is intertwined with that of Sino-Tibetan in respect to two 
families, Tai and Miao-Yao, which have been assigned to Sino-Tibetan by 
some linguists and to Austric by others. The borderline between the well- 
defined Sino-Tibetan family and the controversial Austric phylum has con- 
seguently fluctuated over the years, with a growing consensus that both Tai 
and Miao-Yao either are independent families or belong to Austric rather 
than to Sino-Tibetan. Fig. 4.1 compares the differing subgroups of the 
major Southeast Asian language families as conceived today and at the start 
of this century. 

Until the 18505 the Малда languages were thought to belong to the 
Dravidian family, since speakers of both groups were of the same racial 
type. In 1854, however, Мах Muller separated the Munda languages from 


c. 1900 c. 1980 
TIBETO-BURMAN 


KAREN 
SINO-TIBETAN 


SINO-TIBETAN 
SINITIC 


MIAO-YAO 





TAI 


AUSTROASIATIC АКЫНЫ 


МОМ-КНМЕК 


MUNDA 


Fig. 4.1. Subgrouping of Southeast Asian families, 1900 and 1980 
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Dravidian and coined the term Munda for the family. At about the same 
time J. R. Logan and F. Mason pointed out similarities between Munda lan- 
guages and languages farther east belonging to what is now called the 
Mon-Khmer family; Logan also called attention to resemblances between 
Khasi, spoken in India, and Mon-Khmer. The larger grouping, comprising 
Mon-Khmer and Munda, was dismissed by scholars such as James M. 
Hasswell and C. J. F. S. Forbes during the latter part of the nineteenth cen- 
tury. On the other hand, Ernst Kuhn presented evidence in 1889 substan- 
tiating the link between the Munda family and Mon, Khmer, the languages 
of Malacca, and those of the Nicobar Islands, though he hesitated to callita 
genetic connection. He referred instead to a “common substratum.” Dur- 
ing the twentieth century the grouping that comprises Munda and Mon- 
Khmer has become known as Austroasiatic and its validity as a genetic 
node is now well established. 

In 1906 the Austrian anthropologist Wilhelm Schmidt proposed that the 
Austroasiatic family (= Munda + Mon-Khmer) and the Austronesian fam- 
ily (formerly known as Malayo-Polynesian and covering the greater part of 
Oceania; see section 5.1) together constitute an even larger family, which 
he named Austric (see List 4.20). Schmidt cited striking correspondences 
in prefixes and infixes (the latter being a fairly rare phenomenon in hu- 
man language), as well as certain lexical agreements, to substantiate his 
argument. 

The Austric hypothesis has been controversial from the start and is no 
less so today. Indeed, given the fragmentary nature of the evidence and the 
generally hostile attitude toward proposals of this type, it could hardly 
have been otherwise. From today's perspective it is clear that Schmidt's clas- 
sification contained several errors: (1) Chamic and Jakun are misclassified 
as Austroasiatic; both are members of Austronesian; (2) the Viet-Muong 


List 4.20. The Austric Family (Schmidt 1926) 


AUSTRIC:’ 
I AUSTROASIATIC: [1 MON-KHMER] 
A ASLIAN b Khmer 
B CENTRAL: c BAHNARIC 
1 Khasi а JAKUN 
2 NICOBAR ISLANDS 2 MUNDA 
3 PALAUNGIC D CHAMIC 


C SOUTHEAST AND П AUSTRONESIAN: 





NORTHWEST: A INDONESIAN 
1 MON-KHMER: B OCEANIC 
a MONIC 


“Іп this list, as throughout this volume, modern names have been substituted for some of 
the original terminology, e.g. Malaccan — Aslian, Southeast Mixed — Chamic. 
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group was incorrectly classified as Sino-Tibetan; it is now known to be a 
member of Mon-Khmer; and (3) the Tai and Miao-Yao groups were also in- 
correctly placed in Sino-Tibetan. As we shall see directly, the Tai languages 
are closest genetically to the Austronesian family, and the Miao-Yao lan- 
guages appear to belong to the Austric complex, or at the very least to 
Austro-Tai. 

In the early 1940's a young American linguist, Paul K. Benedict, recog- 
nized a small family of languages in southern China that had obvious af- 
finities with the Tai family. He named this group the Kadai family. Further- 
more, he noticed that both this Kadai group and the Tai family appeared to 
share more cognates with the Austronesian family than with the Sinitic 
branch of Sino-Tibetan, with which they were usually associated. An exami- 
nation of the supposed Sinitic-Tai cognates revealed that the words in ques- 
tion (e.g. numbers, cultural items) were almost certainly loans, and that the 
basic vocabulary (e.g. body parts, pronouns) indicated an unmistakable 
Austronesian affiliation. Benedict published his findings in 1942 and simul- 
taneously clarified both Sino-Tibetan and Austric classification. Benedict 
only tentatively included Miao-Yao in Austric at this point (see List 4.21). 

As Benedict himself noted, the Austro-Tai hypothesis had been fore- 
shadowed in studies published a half century or so earlier. In 1887 Terrien 
de Lacouperie had suggested a relationship between Li (a Kadai language) 
and Formosan languages (which belong to Austronesian). Two years later 
E. H. Parker had recognized similarities between Li and Tai languages, and 
in 1902 Gustav Schlegel called attention to similarities between Tai and 
Malay (an Austronesian language). No one, however, had systematically 
surveyed the available evidence, and drawn the obvious conclusions, until 
Benedict (1942). Even though his Austro-Tai proposal flew in the face of es- 
tablished dogma, it gained support in the 1950's (e.g. Greenberg 1953). 


List 4.21. The Austric Family 
(Benedict 1942) 


AUSTRIC: 

?I MIAO-YAO: 
A MIAO 
B YAO 

II AUSTROASIATIC: 
A MON-KHMER 
B VIET-MUONG 
C MUNDA 

III AUSTRO-TAL 
A TAI 
B KADAI 
C AUSTRONESIAN 
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Nonetheless, some linguists still cling to the traditional view (i.e. that Tai is 
closest to Chinese). 

Schmidt's Austric hypothesis, however, has been less successful. Sebeok 
(1942) reviewed it negatively, and Greenberg (1953) concluded that "the 
Austric hypothesis cannot be accepted on present evidence." As for Miao- 
Yao, Greenberg (1953) found it to be "a separate branch of Sino-Tibetan, no 
more divergent than, say, the Karen languages of Burma." 

After leaving the linguistic scene for some two decades to pursue a ca- 
reer in psychiatry, Benedict returned to the problem of Austro-Tai in the 
mid-1960's and increased the number of putative Austro-Tai roots from 30 to 
around 400. The Austro-Tai hypothesis was strengthened by material from 
the Kam-Sui group, first recognized by Fang-kuei Li in the early 1940s. Like 
the Kadai family, Kam-Sui is a small ^Tai-like" group of languages display- 
ing affinities with both Tai and Austronesian languages. Benedict consider- 
ably bolstered the case for Austro-Tai, but he retreated in his support of 
Austric, reinterpreting the connection between Austroasiatic and Aus- 
tronesian as a "substratum relationship" rather than a true genetic bond. 
Miao-Yao was treated as a separate entity “with Austroasiatic. . . the most 
likely affiliation" (Benedict 1966—67: 327). 

In 1975 Benedict published his collected writings on Austro-Tai, includ- 
ing a comprehensive glossary of Austro-Tai roots. In this work he included 
the Miao-Yao group in an expanded version of the Austro-Tai family, as 
shown in List 4.22. A connection between this expanded Austro-Tai stock 
and the Austroasiatic family (i.e. the Austric hypothesis) was still denied, 
though in rather equivocal terms: "There are indications of a very early link- 
age (less than a full genetic relationship) between A[ustro]-T[ai] and the 
Austroasiatic . . . languages. Our present analysis confirms this linkage. 
[However,] we must still think in terms of a “linkage” or 'substratum' rather 
than of a fully reconstructable language stock" (Benedict 1975: 135). 

Like Benedict, Greenberg revised his classification of these language 
families, coming to feel that both his earlier inclusion of Miao-Yao in Sino- 
Tibetan and his dismissal of Austric were incorrect. By 1980 he viewed the 
Austric phylum roughly as we see it in List 4.23. 


List 4.22. The Austro-Tai Family List 4.23. The Austric Family 
(Benedict 1975) (Greenberg 1980) 
AUSTRO-TAI: AUSTRIC: 
I MIAO-YAO I MIAO-YAO 
П TAI-AUSTRONESIAN: П AUSTROASIATIC 
А DAIC Ш AUSTRO-TAI: 
B AUSTRONESIAN A DAIC 


B AUSTRONESIAN 
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Fig. 4.2. Different definitions of Austric 


4.5.2 Present Status of Austric Classification 


There is, as yet, no consensus on the validity of the Austric phylum, 
with strong opinions on both sides of the issue. Furthermore, even among 
those who support an Austric grouping there are differences on which 
families are to be included, as Fig. 4.2 illustrates. For the overall subgroup- 
ing of Austric, I have adopted Benedict's (1942) position (List 4.21, above). 

The internal subgrouping of the four constituents of Austric differs in 
the degree of clarity attained. The relationships are fairly straightforward 
for Miao-Yao, less so for Austroasiatic and Daic, and quite complex and un- 
settled for Austronesian (see section 5.1). The Miao-Yao family is bifurcated 
into the Miao language as one group and the small Yao (-- Mien) family as 
the other. According to Benedict (1983), Daic apparently has two branches, 
Lati-Gelao and Li-Kam-Tai, with the latter group containing the Kam-Sui 
and Tai families.” 

The Austroasiatic family is larger and more diverse than either Miao-Yao 
or Daic, and no definitive subgrouping for the family has so far been estab- 
lished. List 4.24 illustrates five proposals, by Schmidt (1926), Sebeok (1942), 


*It is customary to use the spelling Tai for the group of languages to which Thai (=Siamese= 
Standard Thai) belongs 


List 4.24. Proposed Subgroupings of the Austroasiatic Family 


Schmidt 1926 


AUSTROASIATIC: 
I ASLIAN 
П CENTRAL: 
A. Khasi 
B PALAUNGIC 
C NICOBAR ISLANDS 
III MON-KHMER-MUNDA: 
A MON-KHMER: 
1 MONIC 
2 Khmer 
3 BAHNARIC 
4 JAKUN 
B MUNDA 
IV CHAMIC 


Pinnow 1959, 1963 


AUSTROASIATIC: 
I WESTERN: 
A Nahali 
B MUNDA 
H EASTERN: 
A NICOBAR ISLANDS 
B PALAUNG-KHMER: 
1 Khasi 
2 PALAUNGIC 
3 MON-KHMER: 
a MONIC 
b Khmer 
c BAHNARIC 
4 ASLIAN 


Voegelin & Voegelin 1977 
AUSTROASIATIC: 


I ASLIAN (including JAKUN) 


II MON-KHMER 
Ш MUNDA 
IV NICOBAR ISLANDS 


Sebeok 1942 


AUSTROASIATIC: 
I MUNDA 
II MON-KHMER: 
A MON-KHMER PROPER: 
1 MONIC 
2 Khmer 
3 BAHNARIC 
4 PEARIC 
B CHAMIC 
C ASLIAN 
D NICOBAR ISLANDS 
E PALAUNGIC-KHMUIC 
F Khasi 


Diffloth 1974 


AUSTROASIATIC: 
I MUNDA 
II NICOBAR ISLANDS 
Ш MON-KHMER: 
A Khasi 
B PALAUNGIC 
C MONIC 
D KHMUIC 
E VIET-MUONG 
F KATUIC 
G BAHNARIC 
H PEARIC 
I Khmer 
J SEMANG 
K SAKAI 
L SEMELAIC 
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Ріппоуу (1959, 1963), Diffloth (1974), апа Voegelin and Voegelin (1977). It 
now appears that а dozen different subfamilies constitute Austroasiatic: 


(14) MUNDA BAHNARIC 
Khasi Khmer 
PALAUNGIC РЕАКІС 
KHMUIC MONIC 
VIET-MUONG  ASLIAN 
KATUIC NICOBAR ISLANDS 


The Austroasiatic subgrouping adopted in this volume represents Gérard 
Diffloth's current appraisal, which he has been kind enough to share with 
me. Diffloth divides Austroasiatic into two branches; the Munda family 
constitutes one branch, and the remaining 11 families constitute the second 
branch, commonly known as Mon-Khmer. The Munda family consists of 
Northern and Southern branches. The Munda classification followed in this 
volume is based on suggestions from David Stampe and Norman Zide. 
Mon-Khmer, according to Diffloth, has three branches, North, East, and 
South, as shown in List 4.25. This Mon-Khmer subgrouping should be re- 
garded as tentative and will likely undergo further modification as research 
on Austroasiatic classification proceeds. 


List 4.25. The Austroasiatic Family (Diffloth 1982) 


AUSTROASIATIC: B EAST: 
I MUNDA 1 KATUIC 
II MON-KHMER: 2 BAHNARIC 
А NORTH: 3 Khmer 

1 Khasi 4 PEARIC 

2 PALAUNGIC-KHMUIC C SOUTH: 

3 VIET-MUONG 1 MONIC 

2 ASLIAN 


3 NICOBAR ISLANDS 
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Oceania 


Close to 40 percent of the world's languages are spoken on the islands of the 
Pacific and Indian oceans. With perhaps a few exceptions, they belong to 
one of three families: Austronesian (formerly Malayo-Polynesian), Indo- 
Pacific (or Papuan), or Australian. The Austronesian family is the most wide- 
spread and the most numerous, but also the most close-knit of the three. 
Austronesian languages are spoken on practically every island from Mada- 
gascar to Easter Island, and from Formosa to New Zealand. Yet despite this 
enormous geographic distribution, their genetic unity is transparent. Lan- 
guages belonging to the Indo-Pacific phylum are concentrated on New 
Guinea and surrounding islands, often in propinquity to Austronesian 
tongues. The languages belonging to Indo-Pacific display a bewildering di- 
versity, and their genetic unity is still not accepted by many scholars. One 
of the few "islands" of the South Pacific that Austronesian speakers never 
inhabited, so far as we know, is the largest, Australia. The Australian family 
is coterminous with Australia itself. АП Australian languages are spoken in 
Australia and, before the English invasion at least, only Australian lan- 
guages were found there. The validity of the Australian family is by now 
both well established and widely accepted. It represents nonetheless an an- 
cient grouping whose genetic unity is by no means obvious. 
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5.1 AUSTRIC: AUSTRONESIAN 


The alleged genetic connections between the Austronesian family and 
the Miao-Yao, Austroasiatic, and Daic families of Southeast Asia were dis- 
cussed in section 4.5. In this section we will examine the Austronesian fam- 
ily, which extends from Madagascar, off the east coast of Africa, through 
Indonesia and the Philippines and across the Pacific Ocean to Hawaii and 
Easter Island. The geographic distribution of the Austronesian family is 
shown in Map 5.1. 

Austronesian comprises close to 1,000 languages with an estimated 180 
million speakers. If we draw a line separating New Guinea and islands to 
the east of it from those to the west (most of Indonesia, the Philippines, 
Formosa, Madagascar), there are roughly equal numbers of languages on 
either side of the line, but almost all the speakers are in the western half. 
Even though close to 500 Austronesian languages are spoken on New 
Guinea and islands farther east, they probably claim fewer than two million 
speakers, compared with an estimated 179 million Austronesian speakers 
in the western half (primarily in Indonesia and the Philippines). 

Only two Austronesian languages outside the Philippines and Indonesia 
have more than a million speakers. Malagasy, on the island of Madagascar, 
has 8 million, and Malay has 10 rnillion in Malaysia, with perhaps 15 mil- 
lion speakers of various dialects (some extremely divergent) elsewhere. 
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Map 5.1. The Austronesian family 
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А form of Malay, known ав Bahasa Indonesia (= Indonesian), has been 
adopted as the national language of Indonesia, where it is spoken by per- 
haps 10 million people as their first language, and by tens of millions more 
as a second language. Indonesia has no fewer than 12 Austronesian lan- 
guages with a million or more speakers: Javanese (50, plus 25 million who 
speak it as a second language), Sundanese (25), Madurese (8), Minangkabau 
(7), Bali (3), Bugis (= Buginese, 2.5), Makassarese (2), Achinese (2), Toba 
Batak (2), Sasak (1.5), Lampung (1.5), and Rejang (1). In the Philippines 
there are eight languages with a million or more speakers: Cebuano (10), 
Tagalog (10), Посапо (5), Hiligaynon (4), Bikol (3), Samar-Leyte (2), Kapam- 
pangan (1.5), and Pangasinan (1). The Philippine national language (Pili- 
pino) is based on Tagalog. Several Austronesian languages are spoken on 
the island of Formosa, where they have been largely submerged by the in- 
fluence of Chinese immigration during the last several centuries. The nu- 
merically strongest language, Ami, has around 100,000 speakers, and most 
others have only a few thousand or less. Several are already extinct. Some 
Austronesian languages of Polynesia are fairly well known despite having 
relatively few speakers (e.g. Samoan, Tahitian, Hawaiian). 


5.1.1 History of Austronesian Classification 


As early as 1603 a Dutchman, Frederick de Houtman, perceived the rela- 
tionship between Malay and Malagasy. A little over a century later another 
Dutchman, Hadrian Reland (1706), recognized similarities among the lan- 
guages of Polynesia, the East Indies, and Madagascar. In 1778 the Polyne- 
sian family was correctly identified by a member of Cook's second voyage 
around the world (1772-75), John Reinold Forster. Given the very close- 
knit nature of the Polynesian family, this early discovery is not surprising. 
Like Bantu, the Polynesian languages are sometimes described as dialects 
of a single language. In 1782 William Marsden noted similarities between 
the languages of Madagascar and Easter Island. Despite these early pre- 
cursors, both Dyen (1971a) and Collins (1980) credit Lorenzo Hervás y 
Panduro with having "definitely established" the Austronesian family in 
1784, two years before William Jones's famous pronouncement on Indo- 
European. 

The Austronesian affinity of the aboriginal languages of Formosa was 
first noticed by J. H. Klaproth in 1822. Because of the present domination of 
Formosa by the Chinese, it is often forgotten that until the initial Chinese 
invasion of 1626, Formosa was inhabited solely by Austronesian speakers. 
In a work published from 1836 to 1839, Wilhelm von Humboldt provided 
sketches of several Polynesian and Indonesian languages and proposed the 
name Malayo-Polynesian for the family. 

Lying between the Indonesian languages and the Polynesian languages 


162 5 OCEANIA 


was а large group that came to be known ав Melanesian. They were at first 
not regarded as members оҒ Malayo-Polynesian; their membership in the 
family came to be accepted only gradually during the second half of the 
nineteenth century. Hans Conon von der Gabelentz published grammatical 
sketches of various Melanesian languages in 1861, expressing the view that 
the Melanesian languages constitute a valid linguistic family and that, in 
addition, the Melanesian and Polynesian languages share numerous simi- 
larities that cannot be attributed to borrowing. In linking the Melanesian 
and Polynesian groups in this manner, Gabelentz foreshadowed what has 
become known as the Oceanic subgroup of Austronesian. 

In 1885 R. H. Codrington published a book on the Melanesian languages 
and emphatically underlined their unmistakable Malayo-Polynesian af- 
finity. He attributed the reluctance of some scholars toinclude the Melanesian 
languages in Malayo-Polynesian to their speakers’ “misfortune to be black” 
(quoted in Grace 1959: 5). Just as in Africa, where the Chadic family was 
long excluded from Afro-Asiatic because Chadic speakers are black and 
the rest of the family is not (see section 3.2), so too in Oceania were 
the Melanesian speakers at first denied their proper place in the Malayo- 
Polynesian family solely because of their race. Nineteenth-century scholars 
expected congruity between race and linguistic group, and those cases that 
did not meet their expectations were often handled poorly. 

Although the Oceanic subgroup of Austronesian had been adumbrated 
in the work of Gabelentz, it was explicitly recognized by Hendrik A. Kern 
in 1886. Furthermore, Kern demonstrated that Fijian, which had previ- 
ously been considered a Melanesian language because its speakers are 
dark-skinned, is in fact most closely related to the languages of the light- 
skinned Polynesians. 

Not until the final decade of the nineteenth century was it discovered 
that not all of the Melanesian languages were Malayo-Polynesian. Sidney 
Ray reported non-Malayo-Polynesian languages in Southeast New Guinea 
in 1892, and a decade later Wilhelm Schmidt demonstrated their presence 
in Northeast New Guinea as well, Ray proposed the term Papuan for these 
non-Malayo-Polynesian languages, restricting “Melanesian” to Malayo- 
Polynesian tongues. The classification of the Papuan languages of New 
Guinea and surrounding islands is discussed in section 5.2 below. 

Schmidt (1899a) proposed the name Austronesian in place of Malayo- 
Polynesian. During the twentieth century Austronesian has become the 
most widely used term for the family as a whole, and Malayo-Polynesian 
has been given a more restricted definition, or eliminated altogether. In an- 
other study published the same year, Schmidt (1899b) concluded that (1) 
the Melanesian languages were definitely Austronesian; (2) the Melanesian 
group is older than the Polynesian group, in the sense that the original 
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Melanesian language was spoken much earlier than the original Polynesian 
language; and (3) the languages most closely related to the Polynesian fam- 
ily are Fijian, Rotuman, the languages of the southern Solomons, and, to а 
lesser degree, those of the central New Hebrides. 

By the beginning of the twentieth century, practically all of the Aus- 
tronesian-speaking areas of the world had been identified. For taxonomists 
the problem of delimiting the Austronesian domain was replaced by the 
more difficult problem of unraveling the internal relationships of the myr- 
iad Austronesian languages. (A second—and during the first half of the 
twentieth century more important—focus of Austronesian research was 
the reconstruction of Proto-Austronesian, often attempted with remark- 
able success.) At the turn of the century, the subgrouping of the Austrone- 
sian family was viewed as follows. First, the Polynesian family constituted 
one clear-cut group, and together with the Melanesian languages formed a 
higher-level node called Oceanic. Second, the remainder of the Austrone- 
sian family, that is, the non-Oceanic languages, were lumped together 
under the rubric Indonesian. And third, it was realized that some of the 
Melanesian languages were in fact genetically closer to the Polynesian fam- 
ily than to the other Melanesian languages. Schmidt (1926) called such lan- 
guages "transitional" Melanesian, implying that they were transitional be- 
tween Melanesian languages proper and the Polynesian family (see List 
5.1). Such a bipartite division of Austronesian has remained popular to 
this day, sometimes under the guise of a Western (-- Indonesian)/Eastern 
(7 Oceanic) terminology (see Pawley 1974). 

The most influential Austronesianist of the first half of the twentieth 
century was the German Otto Dempwolff, who, in a three-volume work 
published from 1934 to 1938, proposed more than 2,000 lexical items for 
Austronesian. Dempwolff also carefully worked out many of the major 
sound correspondences exhibited by the Austronesian family and for the 


List 5.1. The Austronesian Family (Schmidt 1926) 


AUSTRONESIAN: 
I INDONESIAN: 
A WEST 
B EAST 
II OCEANIC: 
A MELANESIAN: 
1 MELANESIAN PROPER 
2 MICRONESIAN (excluding 
Chamorro and Palauan) 
B "TRANSITIONAL" MELANESIAN 
C POLYNESIAN 


164 5 ОСЕАМТА 


first time stated precisely the criteria defining the Oceanic subgroup. But 
Dempwolff was not primarily interested in questions of classification and 
never suggested a complete subgrouping for the whole family. His views 
were probably not significantly different from Schmidt's. 

The precise position of the border between Oceanic and Indonesian was 
clarified only in the 1950's through the work of Wilhelm Milke and George 
Grace, and is presently drawn at roughly 136 degrees east longitude. 

One of the weaknesses of this early classificatory work on Austronesian 
was that, although the evidence that had been developed in support of the 
Oceanic branch was substantial, evidence demonstrating the validity of In- 
donesian was all but nonexistent. During the past two decades the unitary 
nature of Indonesian has been increasingly called into question by a num- 
ber of scholars, as we see in List 5.2. 

Dyen (1965) published a lexicostatistical classification indicating that the 
Austronesian family is composed of some 40 different primary branches. At 
the same time he questioned the validity of the Oceanic subgroup on the 
grounds that his data located the area of greatest internal diversity in Oceanic 
territory. In this confrontation between Dyen's quantitative approach and 
the traditional qualitative methods of comparative linguistics, the latter 
have carried the day, and Dyen's proposals currently have few adherents. 


5.1.2 Present Status of Austronesian Classification 


Despite the renewed interest in Austronesian classification over the last 
decade or so, and the many significant contributions that have appeared 
during that time, no widely accepted subgrouping of the family as a whole 
has yet emerged. The reasons are not hard to pinpoint. The Austronesian 
family presents three of the chief ingredients for taxonomic difficulties. 
First, it consists of a large number of very similar languages. Second, with a 
few exceptions (e.g. Malay-Indonesian, Tagalog, Cebuano), Austronesian 
tongues are spoken by small groups in remote areas of the world and are 
therefore usually considered of no interest to anyone but the dedicated 
scholar or missionary. The literature dealing with them has been both in- 
complete and uneven in quality, but happily this situation has been amelio- 
rated somewhat in recent years by a wealth of new publications. Third, 
most Austronesianists have chosen to investigate only a small portion of 
the vast Austronesian domain. Relatively few scholars have been so au- 
dacious as to attempt to handle the entire family, yet without such a global 
perspective the chances of discovering a defensible subgrouping for Aus- 
tronesian are slim. Recent work by Blust is encouraging in this regard. 

What, one might ask, is presently known of the internal relationships of 
the Austronesian family? First, Austronesian is itself a well-defined genetic 


List 5.2. Proposed Subgroupings of the Austronesian Family 


Haudricourt 1962 


AUSTRONESIAN: 
I NORTHERN (= FORMOSAN) 
II WESTERN (= INDONESIAN) 
Ш EASTERN (= ОСЕАМІС): 
A MICRONESIAN 
B MELANESIAN 
C POLYNESIAN 


Blust 1978 


AUSTRONESIAN: 
I ATAYALIC(- FORMOSAN) 
II TSOUIC (= FORMOSAN) 
III PAIWANIC (- FORMOSAN) 
IV MALAYO-POLYNESIAN: 
A WESTERN 
B CENTRAL-EASTERN: 
1 CENTRAL 
2 EASTERN: 
a SOUTH 
HALMAHERA- 
NORTHWEST 
NEW GUINEA 
b OCEANIC 


Benedict 1983 


AUSTRONESIAN: 
I ATAYALIC (= FORMOSAN) 
II TSOUIC-MALAYO- 
POLYNESIAN: 
A TSOUIC (= FORMOSAN) 
B PAIWANIC-MALAYO- 
POLYNESIAN: 
1 PAIWANIC 
(= FORMOSAN) 
2 MALAYO- 
POLYNESIAN 


Dahl 1973 


AUSTRONESIAN: 
I FORMOSAN 
II NON-FORMOSAN: 
А WESTERN (= INDONESIAN) 
B EASTERN (= ОСЕАМІС): 
1 MELANESIAN 
2 POLYNESIAN 


Reid 1982 


AUSTRONESIAN: 
I ATAYALIC(= FORMOSAN) 
П TSOUIC (= FORMOSAN) 
III PAIWANIC (= FORMOSAN) 
IV SOUTH MINDANAO 
(= PHILIPPINE) 
V AMI-EXIRA-FORMOSAN: 
А Ami 
B EXTRA-FORMOSAN: 
1 OUTER PHILIPPINES: 
a NORTHERN 
PHILIPPINES 
b SOUTHERN 
PHILIPPINES 
2 MALAYO-POLYNESIAN: 
a WESTERN: 
i MESO-PHILIPPINE 
ii CELEBES 
iii BORNEO 
iv SAMA-BAJAW 
v SUNDIC 
b CENTRAL-EASTERN 
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unit. With the exception of Maisin (spoken оп the southeast coast of main- 
land Papua New Guinea) and the languages of the Reef Islands and Santa 
Cruz, there are no disputes on which languages belong to the family. The 
exceptional languages just mentioned have been variously classified as 
Papuan-influenced Austronesian languages or Austronesian-influenced 
Papuan languages. The position followed here is that Maisin is basically an 
Austronesian tongue, probably belonging to the Oceanic subgroup. The 
Reef Islands-Santa Cruz languages are considered Papuan languages, 
forming part of the East Papuan branch of Indo-Pacific. 

Second, within Austronesian there appears to be a growing consensus 
that the Formosan languages occupy a special position, as all the subgroup- 
ings in List 5.2 indicate. It now seems likely that these languages constitute 
either one or several primary branches of Austronesian. Furthermore, work 
by Blust indicates that all extra-Formosan Austronesian languages consti- 
tute a single primary branch, which he calls Malayo-Polynesian (M-P). 
Within M-P the Oceanic subgroup appears to be a valid genetic node, as do 
innumerable small families (e.g. Polynesian, Micronesian, Loyalty Islands, 
Bashiic, South Mindanao, Piru Bay). How these numerous subfamilies fit 
together within M-P is a primary subject of current research, and con- 
tinued refinement (or even revision) of the picture is to be expected in the 
coming years. 

The Austronesian classification presented here is based on the sources 
cited in the bibliography at the end of this section, as well as on the valu- 
able advice of the dozen or so scholars who were kind enough to share 
their expertise with me and whose assistance I have already acknowledged 
at the beginning of this volume. The overall framework for Austronesian 
follows Blust (1978)—see List 5.2—in positing four primary branches, 
three Formosan groups (Atayalic, Tsouic, Paiwanic), and the remainder of 
the family (M-P). Within M-P Blust groups the well-defined Oceanic node 
with a group of languages spoken on the northwest coast of New Guinea 
and the southern portion of Halmahera to form a subgroup called Eastern 
М-Р. This in turn he subgroups with languages spoken in the Maluku re- 
gion of Indonesia to form the node Central-Eastern М-Р, which constitutes 
one of two primary branches of M-P. The other, Western M-P, contains the 
remainder of the Austronesian languages. These embrace most Indonesian 
languages; all Philippine languages; and a few scattered outliers on the 
Malay Peninsula (Malay, Jakun), in Vietnam and Kampuchea and on Hainan 
Island (the Chamic family), on Madagascar (Malagasy), and in Micronesia 
(Chamorro, Palauan, and possibly Yapese). 

The subgrouping within Western M-P is perhaps the least studied and 
least understood area of Austronesian classification. Although it has been 


List 5.3. The Austronesian Family (Blust 1978, Pawley 1972, апа others) 


AUSTRONESIAN: 
I ATAYALIC 
II TSOUIC 
III PAIWANIC 
IV MALAYO-POLYNESIAN: 
A WESTERN: 
1 Chamorro 
2 Palauan 
3 Yapese 
4 NORTHERN PHILIPPINES 
5 SOUTHERN PHILIPPINES 
6 MESO-PHILIPPINE 
7 SOUTH MINDANAO 
8 CELEBES 
9 BORNEO 
10 SAMA-BAJAW 
11 SUNDIC 
B CENTRAL-EASTERN: 
1 CENTRAL: 
a CENTRAL MALUKU 
b SOUTH-EAST MALUKU 
2 EASTERN: 
a SOUTH HALMAHERA-NORTHWEST NEW GUINEA 
b OCEANIC: 
i Maisin 
ii SARMI-YOTAFA 
iii SIASSI 
iv MARKHAM 
v MILNE BAY-CENTRAL PROVINCE 
vi KIMBE 
vii NEW BRITAIN 
viii NEW IRELAND-TOLAI 
ix ADMIRALTY ISLANDS 
x BOUGAINVILLE 
xi CHOISEUL 
xii NEW GEORGIA 
xiii SANTA ISABEL 
xiv SANTA CRUZ 
xv SOUTHERN NEW HEBRIDES 
xvi NEW CALEDONIA 
xvii LOYALTY ISLANDS 
хуш REMOTE OCEANIC: 
a MICRONESIAN 
B SOUTHEAST SOLOMONS 
y CENTRAL AND NORTHERN NEW HEBRIDES 
6 CENTRAL PACIFIC: 
I ROTUMAN-FIIIAN 
II POLYNESIAN 
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customary to include а Philippine group ав опе component of Western М-Р, 
Reid (1981) guestions the validity of such a node. Moreover, it has long 
been realized that there are “Philippine-like” languages in northern Borneo 
and Celebes, so the southern border of a Philippine node is far from clear. 
In List 5.3 groups 4—7 of Western M-P (i.e. Northern Philippines, Southern 
Philippines, Meso-Philippine, South Mindanao) constitute the traditional 
Philippine group. Of the four non-Philippine groups within Western M-P, 
only Sama-Bajaw seems secure as a valid genetic group (at some level). The 
geographic nature of the other three groups (Celebes, Borneo, Sundic) is 
apparent in their names; their validity as well-defined nodes is dubious. It 
will surely turn out that the 11 subgroups of Western M-P shown in List 5.3 
are not all equidistant from one another. However, until someone proposes 
an overall subgrouping for Western M-P, this provisional classification will 
have to suffice. 

The internal subgroupings of the four Philippine groups and Sama- 
Bajaw follow, in the main, a classification compiled by Lawrence Reid on 
the basis of a wide range of published and unpublished materials. I have 
modified his classification slightly by condensing somewhat the number of 
distinct languages in South-Central Cordilleran, Central Philippine, and 
Sama-Bajaw. A few other modifications reflect McFarland (1980) and Zorc 
(1982). For the classification of the other Western M-P groups I have relied 
heavily, but not exclusively, on Wurm and Hattori (1983). 

The Oceanic subgrouping is based broadly on the work of Andrew 
Pawley, with a few modifications introduced on the basis of Wurm and Hat- 
tori (1981). The individual Oceanic subgroups reflect a variety of sources, 
which аге listed in the bibliography to this section. Finally, Yapese, which 
has been classified here as an ungrouped language within Western M-P 
(like Chamorro and Palauan), may in fact turn out to be a member of 
Oceanic; its position is at present unclear. 

The general structure of the Austronesian family adopted in this volume 
is shown in List 5.3. 
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5.2 INDO-PACIFIC 


Although there are more than 700 distinct Indo-Pacific (or Papuan) lan- 
guages spoken on New Guinea and neighboring islands, they count as 
speakers fewer than three million people. The Papuan language with the 
most speakers is Enga, which is spoken in the Western Highlands of Papua 
New Guinea by about 165,000 people. To the west of New Guinea, Papuan 
languages are found on Halmahera, Timor, Alor, and Pantar; east of New 
Guinea they are spoken on New Britain, New Ireland, Bougainville, the 
Solomons, the Reef Islands, and Santa Cruz. In addition to the Papuan lan- 
guages spoken on islands surrounding New Guinea, it has been suggested 
that the indigenous languages of the Andaman Islands (in the Andaman 
Sea just west of lower Thailand) and the extinct languages of Tasmania (off 
the southeast coast of Australia) are distantly related to this family. The geo- 
graphic distribution of the Indo-Pacific family is illustrated in Map 5.2. 
None of the Indo-Pacific languages can truly be called "famous." At best a 
few have attracted some attention in the anthropological literature. 


5.2.1 History of Indo-Pacific Classification 


The languages of New Guinea remained until quite recently among the 
least studied of any in the world. Despite the fact that approximately 1,000 
distinct languages (roughly 20 percent of the world total) are spoken in 
New Guinea, making it the area of the world with the greatest concentra- 
tion of languages (and greatest diversity as well), these languages were vir- 
tually ignored by the scholarly community until the 1950's. In 1950 only a 
handful of small language families had been recognized in various parts of 
the island, and for most of New Guinea little was known at all. The two 
decades from 1955 to 1975 witnessed a veritable explosion of research, and 
today our knowledge of the classification of these languages has reached a 
point that seemed unattainable only 30 years ago. With minor exceptions, 
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Map 5.2. The Indo-Pacific family 


all of New Guinea is now known linguistically, and a general understand- 
ing of the genetic relationships of these languages has been reached, 
though, as we shall see, many problems remain. 

As I noted in the preceding section (5.1), although the Malayo-Polyne- 
sian (M-P) family was identified during the eighteenth century, the affinity 
of the so-called Melanesian languages (which fall geographically between 
the Malayan and Polynesian families) with M-P came to be accepted only 
during the second half of the nineteenth century. The chief impediment to 
including the Melanesian languages in the M-P family appears to have been 
the fact that speakers of Melanesian languages have dark skins, whereas 
the M-P family, as originally conceived, contained only light-skinned Indo- 
nesians and Polynesians. The eventual inclusion of the Melanesian lan- 
guages in M-P (which, thus expanded, came to be called Austronesian 
[AN]) raised the question of whether al! Melanesian languages were in fact 
AN, or whether there might not be some non-AN Melanesian languages as 
well. Several early claims to have discovered such languages turned out to 
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Бе false, the languages іп question having proved to Бе АМ, and it was not 
until 1892 that Sidney Ray demonstrated the existence of non-AN lan- 
guages in southeastern New Guinea. Ray proposed the name Papuan (a 
term already in use) for these non-AN languages, restricting Melanesian to 
the AN languages of Melanesia. At the start of the twentieth century, 
Wilhelm Schmidt documented Papuan languages on the northern part of 
the island as well. 

There are several reasons why Papuan languages were identified so 
much later than the AN family. Most important, no doubt, was the fact that 
Papuan languages are concentrated on a single island (New Guinea), with 
only a few scattered outliers on surrounding islands. Moreover, on New 
Guinea itself a large portion of the coastline was inhabited by AN speakers, 
who, with their boats, could visit the first explorers without the latter even 
having to touch shore. The Papuans, on the other hand, were largely ге- 
stricted to the interior of the island. The first wordlist of a New Guinea lan- 
guage was collected in 1616, almost a century after the first European set 
foot on New Guinea (1526). This language was, however, an AN language, 
as were all the languages on which information became available during the 
following two centuries. It was not until 1822 that a Papuan wordlist was 
finally collected, some four decades after the M-P family had been firmly 
established. During the nineteenth century information on Papuan lan- 
guages began to trickle in, but was still completely overshadowed by AN 
research. Finally, it is likely that the great diversity of Papuan languages, in 
the face of AN homogeneity, contributed to their neglect. In any event, the 
distinction between AN and Papuan languages did not become clear until 
Ray (1893). 

During the first half of the twentieth century, Papuan languages were 
discovered elsewhere on New Guinea and surrounding islands. In Western 
New Guinea they were documented primarily by Dutch missionaries. Table 
5.1 lists those Papuan families that had been identified by the mid-1950's; 
references may be found in Wurm (1975, 1982). 

With the exception of the Australian scholar Arthur Capell, who was a 
pioneer in all three areas of Oceanic research (Austronesian, Australian, 
and Papuan), the academic world showed scant interest in the myriad di- 
verse Papuan languages during the first half of this century, and by 1950 the 
common belief was that the New Guinea area contained innumerable small 
families (only a few of which had been identified), that displayed no rela- 
tionship either among themselves or to languages outside New Guinea. 
Four separate research projects radically altered this picture during the 
19505 апа 1960's: (1) the work of H. K. J. Cowan (1957a,b,c, 1959, 1963); (2) 
the studies of Joseph H. Greenberg (1958, 1960, 1971); (8) numerous pub- 
lications of the New Guinea branch of the Summer Institute of Linguistics; 
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Table 5.1. Early Identification of Рариап Groups 


Year Recognized by Group (current name) 

1872 P.J. B. С. Robidé vanderAa North Halmahera 

1900 W. Schmidt West Papuan (= West New Guinea + North 
Halmahera) 

1905 W. Schmidt Torricelli (Schmidt connected two languages: 
Valman and Monumbo) 

1907 S. Ray Eleman 

1908 W. Schmidt Central Solomons 

1912 G. Frederici Sko 

1914 A. R. Radcliffe-Brown Andaman Islands 

1915 H. van der Veen North Halmahera 

1918 J. H. P. Murray, S. Ray Marind (partial) 

1919 S. Ray Rai Coast 

1922 F. Kirschbaum Ndu 

1941 K. H. Thomas Sko 

1944 A. Capell Timor 

1949 P. Drabbe Kolopom 

1950 J. H. M. C. Boelaars Marind (erpanded) 

19505 Р. Drabbe Awyu, Awyu-Dumut, Bulaka River 

1951 K. Laumann Nor-Pondo 

1951 5. Wurm Kiwaian 

1951 A. Capell Madang 

1956 D. Laycock Upper Sepik 


1957 Н.К.). Cowan West Papuan, North Papuan 


and (4) the research of Stephen A. Wurm and his colleagues at the Aus- 
tralian National University, summarized in Wurm (1975, 1982). 

H. K. J. Cowan was а Dutch civil servant and self-taught linguist who 
devoted considerable time and energy to the languages of Netherlands 
New Guinea, where he served. He was especially interested in their classi- 
fication and was among the first to ргорове the existence of widespread ge- 
netic relationships in the New Guinea area. In 1957 Cowan postulated both 
a West Papuan family (uniting certain languages of Northwest New Guinea 
with a group of languages spoken in North Halmahera) and a North Pa- 
puan family (comprising various languages from Northern New Guinea). 
Cowan expanded the West Papuan family іп the late 19505 to include addi- 
tional New Guinea languages, and in the early 1960's he proposed the inclu- 
sion of the languages of Timor Island as well. 

After finishing his work on the classification of African languages in the 
mid-1950's (see Chap. 3), the American linguist Joseph Greenberg turned 
his attention to the two areas of the world where genetic classification was 
least advanced, South America and the New Guinea area. His conclusions 
regarding South America are discussed in section 6.3; we shall now exam- 
ine those concerning New Guinea. 
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Employing essentially the same methods that had proved successful іп 
classifying African languages (i.e. the compilation of lists of basic vocabu- 
lary for a wide range of languages, with whatever grammatical information 
was available), Greenberg concluded that the non-AN languages of New 
Guinea and surrounding islands were all (or almost all) genetically related. 
He reported his findings in a paper delivered to the Association for Asian 
Studies іп 1960. During the 19605 Greenberg continued to expand his data 
base, and in 1971 he finally published the evidence he had accumulated that 


the bulk of non-Austronesian languages of Oceania, from the Andaman Islands on 
the west in the Bay of Bengal to Tasmania in the southeast, forms a single group of 
genetically related languages for which the name Indo-Pacific is proposed. Тһе 
major exception to this generalization is constituted by the indigenous languages of 
Australia, nearly all of which are generally accepted as related to each other. This 
Australian stock does not seem to align genetically with Indo-Pacific and is ех- 
cluded from it, at least tentatively. With regard to the remaining languages, mem- 
bership in the overall family can be asserted with varying degrees of confidence 
(Greenberg 1971: 807). 


Greenberg proposed 14 subgroups for the Indo-Pacific phylum; the 
seven on New Guinea were given geographic designations (Central, North, 
South, Southwest, West, East, Northeast), and the seven outside New 
Guinea were named after the island(s) on which they were spoken (An- 
daman Islands, Tasmanian, Halmahera, Timor-Alor, Bougainville, New 
Britain, Central Melanesian [= Central Solomons + Santa Cruz]). As a 
first approximation to the internal relationships, Greenberg suggested the 
subgrouping shown in List 5.4. Greenberg’s West Papuan group corre- 
sponded closely with Cowan's, though he included a few additional lan- 
guages. Greenberg extended Cowan’s North Papuan family eastward to in- 
clude languages from Australian New Guinea (now Papua New Guinea). 


List 5.4. The Indo-Pacific Phylum (Greenberg 1971) 


IV WEST PAPUAN: 
A WEST NEW GUINEA 
B NORTH HALMAHERA 
C TIMOR-ALOR 
V EAST NEW GUINEA 
VI NORTHEAST NEW GUINEA 
ҮП PACIFIC: 
A BOUGAINVILLE 
B NEW BRITAIN 
C CENTRAL MELANESIAN: 
1 CENTRAL SOLOMONS 
2 SANTA CRUZ 


INDO-PACIFIC: 
І ANDAMAN ISLANDS 
H tTASMANIAN 
Ш NUCLEAR NEW GUINEA: 
A CENTRAL NEW GUINEA: 
1 KAPAUKU-BALIEM 
2 HIGHLANDS 
3 HUON 
B NORTH NEW GUINEA 
C SOUTH NEW GUINEA 
D SOUTHWEST NEW 
GUINEA 
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As evidence for the Indo-Pacific hypothesis, Greenberg proposed 84 ety- 
mologies linking the 14 subgroups. In addition, he offered sets of ety- 
mologies for those subgroups whose existence was not already established 
(e.g. North New Guinea). А limited amount of grammatical information, 
primarily concerning the distribution of pronominal etyma, was adduced 
as corroboration of the lexical evidence. 

In addition to the individual efforts of Cowan and Greenberg, two 
organizations have helped clarify the classification of Papuan languages. 
Founded in the mid-1950's, the New Guinea branch of the Summer Institute 
of Linguistics (SIL, a missionary organization that produces literacy materi- 
als and Bible translations) has published numerous grammars, diction- 
aries, and other specialized studies on the languages of New Guinea. Some 
SIL members (notably Alan and Phyllis Healey, Kenneth McElhanon, Kari 
Franklin, Bruce Hooley, Oren Claassen, Richard Lloyd) have also made sig- 
nificant contributions to the classification of Papuan languages. 

The other organization actively involved in investigating New Guinea 
languages is the Australian National University, which has sponsored a 
research program under the direction of Stephen Wurm since the mid- 
1950's. The result has been extensive new field research in previously un- 
known regions by numerous scholars, of whom Wurm, Donald Laycock, 
Clemens Voorhoeve, Thomas Dutton, and John Z'graggen have been most ` 

, prominent. The classification of Papuan languages has been a prime con- 
cern of Wurm and his colleagues from the beginning, and one may discern 
in their work two phases. During the initial phase emphasis was placed on 
documenting all of the existing families; it became increasingly clear during 
the 1950's and early 1960's that there were in fact a number of genetic groups 
in New Guinea that were much larger than previously suspected. Further- 
more, the earlier notion that Papuan languages were spoken only by small 
numbers of people had to be revised with the discovery in the Highlands of 
languages with more than 100,000 speakers. 

Eventually, as small families grew into increasingly larger stocks, these 
stocks began to "bump into" each other, or rather, it was seen that a case 
could be made for classifying some languages as members of more than one 
stock. The implication was clearly that these stocks must be components of 
an even broader genetic unit. The second phase of classificatory work be- 
gan with this recognition. Wurm took the first steps toward such a consoli- 
dation in the early 1960's, and in 1965 Carl and Florence Voegelin followed 
him in setting up a large-scale grouping that became known as the Central 
New Guinea Macro-Phylum. This broad grouping continued to expand 
until, in 1970, it had grown to such an extent that a new name was felt 
appropriate; in that year McElhanon and Voorhoeve rechristened it the 
Trans-New Guinea (TNG) phylum. 





List 5.5. Тһе Trans-New Guinea Phylum (Wurm 1982) 


TRANS-NEW GUINEA: 
I MAIN SECTION: 
A CENTRAL AND WESTERN: 
1 FINISTERRE-HUON 
2 EAST NEW GUINEA 
HIGHLANDS 
3 CENTRAL AND SOUTH 


NEW GUINEA-KUTUBUAN 


4 ANGAN 

5 GOGODALA-SUKI 
6 MARIND 

7 KAYAGAR 

8 SENTANI 

9 DANI-KWERBA 
10 WISSEL LAKES- 

KEMANDOGA 
11 MAIRASI-TANAH MERAH 
12 WEST BOMBERAI 
B EASTERN: 

1 BINANDEREAN 

2 GOILALAN 

3 KOIARIAN 

4 KWALEAN 


[B EASTERN] 
5 MANUBARAN 
6 YAREBAN 
7 MAILUAN 
8 DAGAN 
П MADANG-ADELBERT 
RANGE 
III TEBERAN-PAWAIAN 
IV TURAMA-KIKORIAN 
V INLAND GULF 
VI ELEMAN 
VII TRANS-FLY-BULAKA 
RIVER 
ҮШ МЕК 
ІХ SENAGI 
Х PAUWASI 
ХІ МОЕТНЕЕМ 
XII МІМВОКАМ 
ХШ KAURE 
ХІУ SOUTH BIRD'S HEAD 
KV KOLOPOM 
KVI TIMOR-ALOR-PANTAR 





List 5.6. Рариап Language Phyla Other Than the Trans-New Guinea Phylum 


(Wurm 1982) 


WEST PAPUAN: 
I Amberbaken 
H BIRD’S HEAD 
III BORAIL-HATTAM 
IV NORTH HALMAHERA 


EAST BIRD'S HEAD 
GEELVINK BAY 

SKO 
KWOMTARI-BAIBAI 
ARAI 
AMTO-MUSIAN 


TORRICELLI: 
I Urim 
II WEST WAPEI 
III WAPEI-PALEI 





[TORRICELLI] 


IV MAIMAI 


V KOMBIO-ARAPESH 


VI MARIENBERG 
VIL MONUMBO 


SEPIK-RAMU: 
I Gapun 
П SEPIK 
M LEONARD SCHULTZE 
IV NOR-PONDO 
V RAMU 


EAST PAPUAN: 
I YELE-SOLOMONS- 
NEW BRITAIN 
П BOUGAINVILLE 
Ш REEF ISLANDS- 
SANTA CRUZ 
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Тһе ТМС phylum itself gradually increased іп size until around 1975, 
when it reached its present proportions. Although it is still substantially 
less inclusive than Greenberg's Indo-Pacific phylum, it nonetheless con- 
tains two-thirds of all Papuan languages spoken in New Guinea and over 80 
percent of all speakers. List 5.5 shows the TNG phylum as currently con- 
ceived. In addition to the TNG phylum, Wurm and colleagues recognize 10 
other phyla and eight isolated languages, all of which are considered inde- 
pendent on the basis of present evidence. The eight isolates are Warenbori, 
Талгар, Yuri, Busa, Nagatman, Porome, Pauwi, and Massep; the 10 phyla 
are shown іп List 5.6. Sepik-Ramu contains about 100 languages, Torricelli 
about 50, and West Papuan and East Papuan roughly 25 each. The other 
families contain fewer than 10 languages each. 

Although attention has been focused increasingly on higher-level group- 
ings since around 1960, considerable progress on lower levels of classifica- 
tion has also been achieved, and many new families have been identified 
during the past two decades. Table 5.2 lists some of the more important dis- 


coveries; references may be found in Wurm (1975, 1982). 


Table 5.2. Recent Identification of Papuan Groups 


Year Recognized by Group 

1960 65. Wurm East New Guinea Highlands 

1962 A. Capell Dani-Kwerba, Binanderean, Turama- 
Kikorian and others 

1962 W. Davenport Reef Islands-Santa Cruz 

1962 А. Healey Ok, Awin-Pa 

1964 W. Steinkrauss, A. Pence Goilalan 

1964 R. Loving, J. Bass Senagi, Kwomtari-Baibai, Amto-Musian 

1965 С. Allen, C. Hurd Bougainville 

1965 D. Laycock Sepik 

1967 М. Bromley Mek (formerly Goliath) 

1968 Ј. Lloyd, A. Healey Angan 

1968 РР. Laycock Torricelli 

1968 С. Voorhoeve Central and South New Guinea 

1968 K. McElhanon Huon 

1968 W. Dye, P. Townsend, W. Townsend Sepik Hill 

1969 Т. Dutton Eastern Trans-New Guinea (formerly 
Southeast New Guinea) 

1969 K. Franklin Inland Gulf 

1970 O. Claassen, K. McElhanon Rai Coast 

1970 K. McElhanon Finisterre 

1970 5. Wurm Trans-Fly 

1970 K. McElhanon, C. Voorhoeve Trans-New Guinea 

1971  J.Z'graggen Ramu 

1971 Ј. Greenberg East Рариап (= Pacific) 

1973 Р. Laycock Sepik-Ramu 

1973 Е S. Watuseke, J. C. Апсеаих Timor-Alor-Pantar 

1973 К. Franklin, C. Voorhoeve Kutubuan 
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5.2.2 Present Status of Indo-Pacific Classification 


Reactions to Greenberg's Indo-Pacific hypothesis have been mixed. Blust 
(1978) found it “bold and brilliant" (p. 475), with the grammatical evidence 
offered in support of Indo-Pacific showing Greenberg "at his best, display- 
ing the patient, encyclopaedic scholarship, unifying insight, and method- 
ological soundness that combine to assure his position as one of the great 
linguists of our time. . . . In sum, Greenberg’s argument, right or wrong, 
seems destined profoundly to affect all future thinking about linguistic re- 
lationships in this part of the world" (pp. 476—77); and in reexamining 
Cowan's West Papuan phylum, Capell (1975: 685) found "'the comparisons 
presented by Greenberg . . . to be valid and fully acceptable"; but others 
disagree. Probably the most controversial aspect of Greenberg's hypothesis 
has been the inclusion of the (extinct) Tasmanian languages in the Indo- 
Pacific phylum. Wurm (1975, 1982) found Greenberg's evidence unconvinc- 
ing, and Crowley and Dixon (1981: 420) consider it "outrageous." 

It should be borne in mind that much of the problem of classifying the 
Tasmanian languages derives from the fact that the Tasmanian people were 
exterminated before much of anything had been learned of their languages. 
The little material that was gathered is of very poor quality. Though one 
might anticipate, on geographic grounds, a connection between Tasmanian 
and Australian languages, evidence for it has been elusive at best. Recently 
Blake (1981: 63) has suggested 10 possible cognates connecting an Aus- 
tralian language of western Victoria and one Tasmanian dialect. Green- 
berg, by contrast, proposed 20 cognates linking the Tasmanian languages 
with other groups of Indo-Pacific. 

Franklin (1973a) gives a negative appraisal of the Indo-Pacific hypothesis 
on the basis of Greenberg's treatment of the languages of the Gulf District of 
Papua New Guinea. However, since all the Papuan languages of the Gulf 
District are now accepted as genetically related (as members of either TNG 
or Indo-Pacific), Franklin's article deals in effect with questions of sub- 
grouping and not with the ultimate validity of Indo-Pacific. The claim that 
errors in low-level subgrouping call into question the validity of higher- 
level nodes is obviously false; innumerable controversies over Austronesian 
subgrouping have never affected the validity of Austronesian itself. 

There have been to date only two full-scale classifications of Papuan lan- 
guages: Greenberg (1971), using multilateral comparison, and Wurm (1975, 
1982), based in large measure on lexicostatistical analysis. The overall struc- 
ture of both classifications has been illustrated in Lists 5.4-5.6 above. But 
since the two classifications employ completely different terminology, it is 
not readily apparent from these lists how they are alike and how they dif- 
fer. In 1981 I compared these two classifications in order to determine their 
similarities and differences, and I will now briefly summarize my findings. 
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Ав has already been mentioned, опе of the primary differences is that 
Greenberg's classification is considerably more inclusive than Wurm's. 
Wurm's TNG phylum excludes (1) the languages of the Andaman Islands, 
(2) Tasmanian, (3) the Papuan languages found on islands to the east of 
New Guinea (Greenberg's Pacific group, Wurm’s East Papuan), (4) much of 
Greenberg's North New Guinea group, and (5) part of Greenberg's West Pa- 
puan family. Wurm does not simply dismiss the resemblances between 
these groups and the ТМС phylum; in fact he (1982: 255) concedes that lexi- 
cal agreements between the Andaman languages and the West Papuan 
phylum-Timor-Alor-Pantar stock "are quite striking and amount to virtual 
formal identity . . . in a number of instances." Nevertheless Wurm prefers 
to attribute the resemblances to "widespread old substrata" rather than to 
common origin. He also finds TNG "substratum influence" responsible for 
resemblances among the East Papuan languages, hypothesizing that "at 
least some East Papuan Phylum languages were driven out of the New 
Guinea mainland by speakers of Trans-New Guinea Phylum languages" 
(Wurm 1975: 798, 799). 

Wurm admits only a portion of Cowan's (and Greenberg's) West Papuan 
group into the TNG phylum, namely the South Bird's Head, West Bom- 
berai, and Timor-Alor families. The remainder of Cowan's West Papuan 
phylum is treated as three independent phyla: Geelvink Bay, East Bird's 
Head, and (a diminished) West Papuan that now includes only those lan- 
guages on the extreme northwest tip of New Guinea and in Northern 
Halmahera. Similarities between the (reduced) West Papuan and ТМС phyla 
are again attributed to substratum influence (see Wurm 1982: 204). 

With the exception of the small Sko family, all of Cowan's North Papuan 
phylum has now been incorporated in the TNG phylum. "In spite of the 
quite sizable Trans- New Guinea Phylum lexical elements in [Sko], which 
includes verbs and pronouns," Wurm (1982: 72) excludes it from the TNG 
phylum because it fails to exhibit the "typological and structural" criteria 
characteristic of that phylum. Moving in a different direction, Greenberg 
expanded Cowan's North Papuan family to include the Sepik-Ramu and 
Torricelli families. He calls this expanded stock North New Guinea. We may 
note that Greenberg's North New Guinea group corresponds to four ge- 
netically independent families in the Wurm classification: Sepik-Ramu, 
Torricelli, Sko, and TNG. 

Although the differences between the two classifications are by no means 
inconsequential, there are also important similarities: 

1. Both classifications concur on the genetic unity of the Papuan lan- 
guages spoken on Bougainville, New Britain, the Solomon Islands, and 
Santa Cruz. Greenberg calls this group Pacific; Wurm, East Papuan. Fur- 
thermore, the subgroupings proposed are practically identical, the sole dif- 
ference being that Greenberg subgroups the Solomon group with the Santa 
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Cruz languages, whereas Wurm subgroups them with the New Britain 
languages. 

2. Both classifications concur on a genetic unit that Greenberg calls 
Northeast New Guinea and Wurm calls Madang-Adelbert Range. 

3. There is also very close agreement on the genetic unity of the lan- 
guages in Greenberg's East New Guinea group, which corresponds almost 
exactly with the Eastern branch of the Main Section of the TNG phylum. 

4. Of Greenberg's seven mainland groups, five (Central, Southwest, 
South, East, and Northeast New Guinea) correspond to constituents of 
Wurm's ТМС phylum. In addition, portions of Greenberg’s West New 
Guinea and North New Guinea groups are included in the TNG phylum. 


List 5.7. The Indo-Pacific Phylum (Greenberg 1971, Wurm 1982) 


INDO-PACIFIC: ІШ TRANS-NEW GUINEA] 
I TTASMANIAN B MADANG- 
II ANDAMAN ISLANDS ADELBERT RANGE 
III TRANS-NEW GUINEA: C TEBERAN-PAWAIAN 
A MAIN SECTION: D TURAMA-KIKORIAN 
1 CENTRAL AND WESTERN: E INLAND GULF 













a FINISTERRE-HUON F ELEMAN 

b EAST NEW GUINEA С TRANS-FLY- 
HIGHLANDS BULAKA RIVER 

c CENTRAL AND SOUTH H MEK 
NEW GUINEA- I SENAGI 
KUTUBUAN J PAUWASI 

d ANGAN K NORTHERN 

e GOGODALA-SUKI L NIMBORAN 

f MARIND M KAURE 

g KAYAGAR М SOUTH BIRD'S HEAD 

h SENTANI O KOLOPOM 


i DANI-KWERBA 


P TIMOR-ALOR 














j WISSEL LAKES- IV WEST PAPUAN 


KEMANDOGA V EAST BIRD'S HEAD 

k MAIRASI-TANAH VI GEELVINKBAY 
MERAH VII SKO 

1 WEST BOMBERAI VIII KWOMTARI-BAIBAI 

2 EASTERN: IX ARAI 

a BINANDEREAN X AMTO-MUSIAN 

b GOILALAN XI TORRICELLI 

c KOIARIAN XII SEPIK-RAMU 

d KWALEAN XIII EAST PAPUAN 

e MANUBARAN 

f YAREBAN 

g MAILUAN 


h DAGAN 
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5. The most important similarity between the two classifications is simply 
that the vast majority of Papuan languages are now recognized ав forming а 
single genetic unit. Whether that unit encompasses two-thirds of the Pa- 
puan languages (TNG) or nearly all of them (Indo-Pacific), the long-lived 
myth of New Guinea as a land beyond the pale of genetic classification has 
finally been laid to rest. 

Whether the widespread similarities among Papuan languages of Oce- 
ania will prove to be the residue of an ancient genetic unity, as Greenberg 
claims, or simply the result of borrowing (i.e. substratum influence), as 
Wurm suggests, it is too early to tell. So much important work has been 
done in recent years that it is easy to lose sight of the fact that we are still in 
the initial stages of the investigation of Papuan classification, and as more 
material accumulates, our understanding of the relationships among these 
diverse languages is likely to continue to undergo revision. The classifica- 
tion adopted in this volume, outlined in List 5.7, represents a compromise 
between the two classifications discussed above. At the highest level Indo- 
Pacific is used to designate a phylum consisting of all the Papuan lan- 
guages, but the internal subgrouping within Indo-Pacific follows Wurm 
(1982). The classification of the languages of the Andaman Islands follows 
Bradley (1983). 


Anceaux, J. C. 1958. "Languages of the Bomberai Peninsula," Nieuw-Guinea Studien 
2: 109-21. 

Blake, Barry J. 1981. Australian Aboriginal Languages. Sydney. 

Blust, Robert. 1978. Review of Current Trends in Linguistics, ҮШ: Linguistics in 
Oceania, ed. Thomas A. Sebeok, Language 54: 467—80. 

Bradley, David. 1983. "Map of Andaman and Nicobar Islands," in S. A. Wurm and 
Shiró Hattori, eds., Language Atlas of the Pacific Area 2, Map 37. Stuttgart. 

Capell, Arthur. 1944. “Peoples and Languages of Timor," Oceania 14: 330-37. 

. 1975. "The "West Papuan Phylum': General, and Timor and Areas Further 
West," in PLNGLS. 

Cowan, H. K. J. 1957a. "A Large Papuan Language Phylum in West New Guinea," 
Oceania 28: 159-67. 

. 1957b. "Prospect of a ‘Papuan’ Comparative Linguistics,” Bijdragen tot de 

Taal-, Land- en Volkenkunde 113: 70—91. 

. 1957c. "Een tweede grote Papoea-taalgroepering in Nederlands-Nieuw- 

Guinea," Nieuw Guinea Studien 1: 106—18. 

. 1959. "La classification des langues papoues," Anthropos 54: 973-81. 

. 1963. “Le buna’ de Timor: une langue ‘ouest-papoue,’” Bijdragen tot de Taal-, 
Land- en Volkenkunde 119: 387-400. 

Crowley, Terry, and R. M. W. Dixon. 1981. “Tasmanian,” in R. M. W. Dixon and 
Barry J. Blake, eds., Handbook of Australian Languages 2. Amsterdam. 

DuBois, Cora Alice. 1944. The People of Alor. Minneapolis. 

Franklin, К. J. 1973a. “Тһе Gulf Area in the Light of Greenberg’s Indo-Pacific Hy- 
pothesis,” in Franklin, PL C26, cited below. 

















184 5 OCEANIA 





, ed. 1973b. The Linguistic Situation in the Gulf District and Adjacent Areas, Papua 
New Guinea. PL C26. 

Grace, George W. 1968. "Classification of the Languages of the Pacific," in Andrew P. 
Vayda, ed., Peoples and Cultures of the Pacific. Garden City, N.Y. 

Greenberg, Joseph H. 1958. "Report on the Classification of the Non-Austronesian 
Languages of the Pacific," unpublished manuscript. 

. 1960. “Indo-Pacific Etymologies," unpublished manuscript. 

. 1971. "The Indo-Pacific Hypothesis," in CTIL 8. 

Laycock, D. C., and C. L. Voorhoeve. 1975. "History of Research in Papuan Lan- 
guages," in PLNGLS. 

Laycock, D. C., and S. A. Wurm. 1975. "Possible Wider Connections of Papuan 
Languages," in PLNGLS. 

McElhanon, K. A. 1971. "Classifying New Guinea Languages," Anthropos 66: 
120-44. 

. 1978. А Classification of the Languages of the Morobe Province, Papua New 
Guinea. Canberra. 

McElhanon, K. A., and C. L. Voorhoeve. 1970. The Trans— New Guinea Phylum: Explo- 
rations іп Deep-Level Genetic Relationships. PL В16. 

Plomley, N. J. B. 1976. A Word-List of the Tasmanian Aboriginal Languages. Hobart. 

Ray, Sidney H. 1893. "The Languages of British New Guinea," Transactions of the 
Ninth International Congress of Orientalists, London, 1892. 

Ruhlen, Merritt. 1981. "A Comparison of Two Classifications of Papuan Languages," 
unpublished manuscript. 

Schmidt, Wilhelm. 1900—1901. "Die sprachlichen Verháltnisse von Deutsch Neu- 
Guinea," Zeitschrift für Afrikanische und Ozeanische Sprachen, 2 parts, 5: 345—84, 6: 
1-99. 

. 1952. Die tasmanischen Sprachen. Utrecht. 

Voegelin, C. F., and F. M. Voegelin. 1965. Languages of the World: Indo-Pacific fascicle 

ve. AL 7. 

Stephen А. 1971. “Тһе Papuan Linguistic Situation,” іп CTIL 8. 

. 1974. “Papuan Languages,” іп ЕВ 13: 977—78. 

. 1982. Papuan Languages of Oceania. Tübingen. 

, ed. 1975. Papuan Languages and the New Guinea Linguistic Scene. PL C38. 

Wurm, Stephen A., and K. McElhanon. 1975. “Papuan Language Classification 
Problems," in PLNGLS. 

Zide, Norman H. 1985. "Robert Colebrooke and the First Linguistic Notice of the 
Andamanese Languages," unpublished manuscript. 























5.3 AUSTRALIAN 


About 170 Australian languages are still spoken by some 30,000 people. 
There are perhaps another 20,000 Aborigines who no longer speak their 
original language. All but about 50 of these languages appear to be rapidly 
heading for extinction in the next generation or two, and a good number of 
Australian tongues have already disappeared in this, and the previous, 
century. The numerically strongest language is the Western Desert Lan- 
guage, spoken over a wide area of Western Australia by perhaps 5,000 
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ËH N on—Pama-Nyungan 
= Pama-Nyungan 





Map 5.3. The Australian family 


people. Languages in the eastern third of Australia are for the most part 
either extinct or moribund. The Australian languages that have been best 
able to maintain themselves are found in the Northern Territory and West- 
ern Australia. Like the Papuan languages of New Guinea, there are no 
famous Australian languages. They are generally known only to anthropol- 
ogists and linguists who specialize in their study. The geographic distri- 
bution of the Australian family is shown in Map 5.3. 


5.3.1 History of Australian Classification 


Although European explorers landed in Australia as early as 1616, the 
first recording of an Aboriginal language was made only in 1770, when a 
member of Captain Cook's expedition collected some information оп Guugu 
Yimidhirr. The first English settlement, on the present site of Sydney, was 
begun in 1788, and the founding of Hobart in 1803 marked the invasion of 
Tasmania. The history of the study of Australian Aboriginal languages is 
neither long nor complex. Since the English invasion of Australia the Ab- 
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origines have, in the best of times, been the subject of "benign neglect," 
and in the worst of times, the object of periodic acts of genocide. Under 
such conditions the study of their languages could hardly thrive. During 
the nineteenth century investigation of Australian languages was almost 
exclusively the domain of missionaries and amateurs, who produced mate- 
rials that from a modern vantage point leave much to be desired. During 
the first half of the twentieth century, the situation deteriorated during 
what Dixon (1980: 16) calls "the dark ages of Australian Aboriginal policy." 
Most amateur and missionary work ceased, and there was no government- 
sponsored research. Except for an occasional isolated scholar, such as 
Arthur Capell, linguistic work on Australian languages came to a standstill. 
In the 1960's the picture began to change for the better as a consequence of 
the work of three organizations: the Australian National University (and 
later other Australian universities), the Australian Institute of Aboriginal 
Studies, and the Australian branch of the Summer Institute of Linguistics. 

The diversity of Australian languages was recognized only toward the 
end of the eighteenth century; previously it had been widely believed that 
there was but a single Australian language. The first book on an Australian 
language (Awabakal) was published in 1827 by the missionary L. E. Threl- 
keld, who followed it with a grammar of the same language in 1834. Awaba- 
kal became extinct around 1850. 

Australian linguistics in the nineteenth century created few milestones. 
One, however, was the recognition by George Grey in 1841 that all Aus- 
tralian languages have a common origin. Grey had explored parts of West- 
ern Australia and had compiled wordlists from languages across the conti- 
nent. He detected a number of cognates that spanned the entire continent 
and on this basis posited "a community of origin for the aborigines" 
(quoted in Dixon 1980: 12). The second, and final, milestone of the nine- 
teenth century was the publication of Edward M. Curr's The Australian Race 
іп 1886-87. Curr was a sheep farmer north of Melbourne, who collected 
wordlists for almost 500 languages and dialects and on the basis of their 
comparison continued the task begun by Grey of identifying pan-Australian 
cognates. 

The first serious attempt at a genetic classification of Australian lan- 
guages was published in 1919 by the Austrian cleric Wilhelm Schmidt. 
Schmidt's contribution to the classification of Asian and Oceanic languages 
has already been discussed in sections 4.5 and 5.2 above. But Schmidt had 
a particular and enduring fascination for Australian languages, as mani- 
fested in the two monographs he published in 1919 and in the compendium 
on the Tasmanian languages that appeared in 1952, two years before his 
death at the age of 86. Though he never visited Australia, Schmidt gathered 
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together іп Vienna all the published material һе could find оп Australian 
languages (relying heavily on Curr's wordlists). In 1919 he proposed a clas- 
sification that recognized a relatively homogeneous group covering the 
southern two-thirds of Australia and a very diverse Northern group that 
appeared to consist of a number of families related neither to the Southern 
group nor to each other. Тһе diversity of the Northern languages had been 
largely overlooked by nineteenth-century investigators such as Grey and 
Curr, who had really only perceived the uniformity of the large Southern 
oup. 

Although Schmidt may be credited with being the first to recognize the 
diversity of the Northern languages, his denial of the genetic unity of all 
Australian languages did not gain general acceptance. As early as 1923 the 
American anthropologist Alfred L. Kroeber (better known for his work on 
American Indian languages) examined the Curr vocabularies and con- 
cluded that Australian languages did in fact constitute a valid genetic fam- 
ily, though the Northern region was clearly the most diverse area. (Some 30 
years later Greenberg [1953] examined the Curr vocabularies and arrived at 
conclusions identical to Kroeber's.) Capell (1937) also expressed a belief in 
the unity of Australian languages and investigated the neglected Northern 
languages in the early 19405. In 1956, building on the earlier observations of 
Grey, Curr, Schmidt, and Kroeber, Capell proposed more than 50 lexical 
and grammatical cognates distributed throughout the Australian conti- 
nent, north and south. 

In the early 1960's the American linguist Kenneth Hale showed that the 
Eastern portion of Schmidt's diverse Northern group consisted in reality of 
phonologically aberrant Southern languages. In redrawing the boundary 
between Northern and Southern languages, Hale proposed the name 
Pama-Nyungan for the homogeneous Southern group, combining the words 
for ‘man’ from the Northeast and Southwest corners of Australia. Except 
for the exclusion of the Northeastern languages, Schmidt's proposed sub- 
grouping of the Pama-Nyungan family was quite good; it forms the basis of 
our current understanding of Pama-Nyungan classification. 

In 1966 two similar classifications of Australian languages, based on lex- 
icostatistical analysis, were proposed by O'Grady, Voegelin, and Voegelin 
and by O'Grady, Wurm, and Hale. The Australian phylum was divided 
into 29 first-order subgroups, one of which was Pama-Nyungan. The other 
28 groups (some consisting of a single language) were restricted to north 
central and northwest Australia. This lexicostatistic classification was ad- 
mittedly tentative, and later research uncovered a number of errors. No 
one, however, has yet proposed any alternative hypothesis for the higher 
levels of classification. 
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5.3.2 Present Status of Australian Classification 


It is generally accepted today that all the indigenous languages of the 
Australian continent, including the language of the western Torres Straits 
(Kala Lagaw Ya = Mabuiag), but excluding Meriam (a Papuan language 
spoken in the eastern Torres Straits) and the extinct Tasmanian languages, 
are genetically related. Dixon (1980: 225) expresses reservations about the 
affinity of Tiwi and Djingili with the rest of the languages, but concedes 
that they may yet prove to be Australian. The majority of scholars already 
accept them as such. 

Within this ancient Australian phylum it is possible to discern many 
low-level genetic groups consisting of a handíul of languages (e.g. Yiwai- 
djan, Gunwinyguan, West Barkly, Daly) as well as one low-level genetic 
group encompassing most Australian languages (i.e. Pama-Nyungan). Sev- 
eral peculiarities of the Australian situation, such as the extreme pho- 
nological homogeneity of all Australian languages and their lexical diver- 
sity, have so far prevented any clarification of the internal relationships of 
the Australian phylum. Dixon (1980: 226) even questions the validity of 
Pama-Nyungan as a genetic subgroup of the Australian phylum, but most 
scholars continue to regard it as a valid subgroup at some level. 

The Australian classification followed here is based on Walsh and Wurm 
(1981), which is itself a revised version of the 1966 lexicostatistic classifica- 
tion discussed above. Following the advice of Paul Black (1982), I have mod- 
ified the Walsh and Wurm classification by (1) condensing the number of 
distinct languages listed (here as elsewhere in this volume, one dialect has 
often been chosen to represent a language that may have several other 
dialects), (2) replacing the names (or their spelling) with others in a good 
number of instances (Australian language names have numerous variant 
spellings, sometimes a dozen or more, and there is no accepted standard- 
ization), and (3) altering certain details of the subgrouping. Echoing Dixon's 
sentiments, Black finds many aspects of the present classification woefully 
inadequate, but no better proposals are currently available. We may hope 
that the current revival of interest in the classification of Australian lan- 
guages, manifested in the work of Dixon, Black, and others, will eventually 
lead to an even better understanding of the internal relationships of this 
unique family. 

The Australian phylum consists of 15 isolated languages (Enindhilya- 
gwa, Ndjébbana, Yanyuwa, Gagudju, Kungarakany, Mangarayi, Mingin, 
Nakkara, Nunggubuyu, Tiwi, Waray, Limilngan, Umbugarla, Gunbudj, 
Murrinh-Patha) and 15 groups. Leaving aside the 15 isolated languages, the 
structure of the rest of the Australian phylum is illustrated in List 5.8. 
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List 5.8. The Australian Family (Walsh & Wurm 1982, Black 1982) 


AUSTRALIAN: [XV PAMA-NYUNGAN] 
I YIWAIDJAN K GUMBAYNGGIRIC 
П Mangerrian L YUIN-KURIC 
Ш GUNWINYGUAN M WIRADHURIC 
IV BURARRAN N Baagandji 
V MARAN O ТҮОТАҮОТІС 
VI WEST BARKLY P tKULINIC 
ҮП GARAWAN Q tNGARINYERIC- 
VIII LARAGIYAN YITHAYITHIC 
IX DALY R KARNIC 
X DJAMINDJUNGAN S WAGAYA-WARLUWARIC 
ХІ DJERAGAN T GALGADUNGIC 
XII BUNABAN U ARANDIC 
XIII NYULNYULAN V SOUTH-WEST: 
XIV WORORAN 1 NGUMBIN 
XV PAMA-NYUNGAN: 2 MARNGU 
А YUULNGU 3 NGAYARDA 
B TANGIC 4 MANTHARDA 
C PAMAN 5 KANYARA 
D ҮАГАМИС 6 TKARDU 
E YIDINYIC 7 WADJARI 
F DYIRBALIC 8 WATI 
G NYAWAYGIC 9 NGARGA 
H MARIC 10 tNyungar 
I WAKA-KABIC 11 Ngadjunmaya 
J TDURUBALIC 12 YURA 
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6 
North and South America 


As Rowe (1974) points out, grammars of New World languages began to 
appear in the sixteenth century, no later than those of European, Asian, 
and African languages. During the sixteenth century at least six New World 
grammars appeared (Tarascan, Quechua, Nahuatl, Zapotec, Mixtec, Tupi), 
and the seventeenth century saw the appearance of 15 more. Only two of 
the New World grammars published before 1700, however, dealt with lan- 
guages spoken north of Mexico (Timucua, Massachusett). 

During the eighteenth century linguistic materials accumulated in ever- 
increasing numbers, and finally in the nineteenth century problems of clas- 
sification were seriously investigated for the first time. Toward the end of 
the nineteenth century, John W. Powell (1891) classified the languages north 
of Mexico into 58 independent families, many of which contained but a 
single language. An even greater number of low-level genetic groups were 
identified in Central and South America. Brinton (1891) recognized 73 in 
South America alone. At the beginning of the twentieth century, more than 
150 distinct New World families had been identified, though much of 
South America was still poorly known. 

During the first half of the twentieth century, several scholars attempted 
to reduce the total number of families by combining certain groups into 
more comprehensive stocks. Chief among these scholars was Edward Sapir, 
who suggested in 1929 that the languages north of Mexico could be re- 
duced to just six phyla, and those of Central America to 15. 

In South America, however, the number of distinct families increased 
from 73 (Brinton 1891) to 77 (Rivet 1924) to 94 (Loukotka 1935) to 114 (Lou- 
kotka 1944) to 117 (Loukotka 1968). Furthermore, баріг5 (1929) article 
turned out to be a watershed in the classification of North American lan- 
guages; most Amerindian scholars after Sapir (including his students) 
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came to feel that he had gone too far in consolidation. In 1964 a conference 
on the classification of American Indian languages north of Mexico re- 
placed Sapir's six phyla with 16 independent families; 12 years later another 
conference on a similar theme proposed 62 distinct genetic groups north of 
Mexico (see Campbell апа Mithun 1979). Combining Loukotka's (1968) 117 
South American families with Campbell and Mithun's (1979: 3—69) 62 North 
American families, and adding perhaps 20 families restricted to Central 
America, brings us to a total of roughly 200 distinct genetic groups, accord- 
ing to the current consensus of Amerindian scholars. 

This is certainly a startling figure. If the rest of the world's languages can 
be subsumed in a dozen or so families, as I have indicated in this volume, or 
even in the several dozen families that more conservative linguists might 
postulate, the figure does not approach 100, much less 200. The mystery is 
compounded by the fact that man's presence in the New World is of recent 
date compared with the rest of the world. On these grounds alone one 
should expect less diversity in the New World than elsewhere. 

One linguist has argued for a radically different view of New World lan- 
guages since the mid-1950's. After completing his work on African classifi- 
cation (see Chap. 3), Joseph Greenberg turned his attention to the two 
areas of the world for which there was little understanding of the genetic 
relationships of languages, New Guinea and South America. (His conclu- 
sions regarding the New Guinea area are discussed in section 5.2.) 

Soon after Greenberg began investigating South American Indian lan- 
guages in the mid-1950's, he realized that they could not be fruitfully inves- 
tigated by themselves, since lexical cognates characterizing these languages 
were also found in the languages of Central America and much of North 
America. After expanding his purview to the entire New World, Greenberg 
soon concluded that New World languages grouped themselves into but 
three genetic units: Eskimo-Aleut, Na-Dene, and Amerind. He reported his 
findings in a paper delivered in 1956, but not published until 1960. Green- 
berg continued to gather Amerindian data during the 1960's and especially 
the 1970's, and finally published the evidence supporting his tripartite clas- 
sification in 1987. It is in terms of this classification that New World lan- 
guages will be examined in this volume. 
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6.1 ESKIMO-ALEUT 


There are about 10 Eskimo-Aleut languages, with a total of 85,000 speak- 
ers. Aleut is spoken in the Aleutian Islands (Alaska) by some 700 people; 
an additional 1,500 Aleuts no longer speak the language. It is also spoken 
in the Commander Islands (U.S.S.R.) by roughly 20 members of the total 
Aleut population of 400. Several varieties of Eskimo are spoken across the 
northern perimeter of North America, from Alaska in the west through 
Canada to Greenland in the east. There are, in addition, about 600 speakers 
of three Eskimo languages spoken in northeastern Siberia (U.S.5.R.). The 
geographic distribution of the Eskimo-Aleut family is shown in Map 6.1. 

It is sometimes forgotten that the Eskimo were known in Europe cen- 
turies before Columbus first reached the New World. The Vikings landed in 
Greenland early іп the ninth century A.D. and by the end of the century 
established permanent settlements that would last into the fifteenth cen- 
tury. Since Greenland was already inhabited by Eskimo before the arrival of 
the Vikings, knowledge of the Eskimo gradually filtered back to Europe via 
Norse travelers. 

The study of the Eskimo language(s), however, dates only from the mid- 
eighteenth century, when Paul Egede published the first (Greenlandic) 
Eskimo dictionary (1750) and grammar (1760). These works were described 
by Hoijer (1976: 4) as "the foundations of Eskimo linguistic studies." The 
enormous geographic distribution of the Eskimo languages was not at first 
appreciated, but by 1836 Albert Gallatin was able to sketch the boundaries 
of the Eskimo family “with considerable precision" (Powell 1891). Тһе sec- 
ond landmark in Eskimo linguistics was Samuel Petrus Kleinschmidt's 
Greenlandic grammar (1851) and dictionary (1871), which showed a no- 
table advance in linguistic sophistication over Egede's works. 

Apparently the first person to perceive the affinity of Eskimo and Aleut 
was the Dane Rasmus Rask, whose work on Indo-European was discussed 
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in section 2.3. According to Pedersen (1931: 136), Rask made his observa- 
tion around 1818 іп а manuscript that remained unpublished for almost а 
century. In any event, by the time of the Powell classification (1891), it was 
generally agreed that Eskimo and Aleut formed one genetic group. 

Early in the twentieth century it was realized that there were basically 
two varieties of Eskimo, an Eastern type (Inuit or Inupiag) and a Western 
type (Yupik). The border between these two varieties was at first placed be- 
tween Alaska and Canada, but in the 1950's it became apparent that the true 
border was located at Norton Sound on the west coast of Alaska. 

There have been numerous attempts to link the Eskimo-Aleut family 
with other families. The question of the external connections of Eskimo- 
Aleut is taken up in section 7.3. 


List 6.1. The Eskimo-Aleut Family 
(Krauss 1976, 1979) 


ESKIMO-ALEUT: 
I Aleut 
II ESKIMO: 
A INUIT 
B YUPIK: 
1 ALASKAN 
2 SIBERIAN 
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Тһе internal relationships of the Eskimo-Aleut family аге fairly clear and 
uncontroversial. There і а sharp cleavage between Aleut апа Eskimo anda 
clear distinction between Inuit and Yupik Eskimo. The classification fol- 
lowed here, based on Krauss (1976, 1979), is shown in List 6.1. 
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6.2 NA-DENE 


The Na-Dene phylum consists of three independent languages (i.e. 
Haida, Tlingit, Eyak) and the Athabaskan family. Their geographic distri- 
bution is shown in Map 6.2. There are approximately 300 speakers of Haida 
on the Queen Charlotte Islands (off the west coast of Canada) and in the 
Alaskan panhandle. There are an additional 1,700 Haida who no longer 
speak the language. Of the total Tlingit population of 10,000, only 2,000 still 
speak the language. Most speakers reside in southeastern Alaska, though 
there are perhaps 100 speakers in Canada. Originally spoken along the 
southeastern coast of Alaska, Eyak had only two surviving speakers, both 
old, in the late 19705 and may now be extinct. 

About 30 Athabaskan languages, with an estimated 200,000 speakers, are 
spoken in most of the Alaskan interior, much of western Canada, and the 
American Southwest (i.e. Arizona and New Mexico). Formerly Athabaskan 
languages were also spoken on the northern California and Oregon coasts, 
but these languages are now either extinct or bordering on extinction. The 
numerically largest Athabaskan language is Navajo, spoken in New Mexico 
and Arizona by 130,000 people. An additional 20,000 Navajo no longer 
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speak the language, which is Ше only American Indian language north of 
Mexico with more than 100,000 speakers. 


6.2.1 History of Na-Dene Classification 


According to Krauss (1964: 127), information on all three constituents of 
the Na-Dene phylum (Haida, Tlingit, Athabaskan) was first published іп 
the eighteenth century. Edward Thompson ss Chipewyan vocabulary (1742) 
was apparently the first linguistic work on an Athabaskan language; mate- 
rial on Tlingit (1786) and Haida (1791) was gathered toward the end of the 
century. The possibility that Tlingit and Haida are genetically related was 
first entertained at the close of the eighteenth century. А Russian linguist, 
Rezanov, collected the first Eyak data in 1805. Although he perceived simi- 
larities between Eyak, Тапаіпа (ап Athabaskan language), and Tlingit, he 
attributed them to borrowing rather than to common origin. In 1816 Ade- 
lung and Vater discussed a possible relationship between Athabaskan, 
Eyak, апа Tlingit, but reached по firm conclusions. 

By the mid-nineteenth century Russian linguists had arrived at an under- 
standing of the Na-Dene phylum that differed little from that reached by 
Americans a century later (see Krauss 1976: 334). With the sale of Alaska to 
the United States in 1867, however, the Na-Dene languages passed out of 
the Russian zone of inquiry into the American. 

Gallatin, in his 1836 classification of North American Indian languages, 
recognized the Athabaskan family, though only its Northern varieties. In 
addition to the large Athabaskan-speaking territory in Alaska and Western 
Canada, there are two isolated Athabaskan enclaves far to the south. Pacific 
Coast Athabaskan was spoken on the Oregon-California coast; today these 
languages are all either extinct or moribund. Apachean, comprising Navajo 
and various dialects of Apache, is found in Arizona and New Mexico. Pa- 
cific Coast Athabaskan and Apachean were first connected with the North- 
ern languages by William Turner and Horatio Hale in the 1850's. 

In the Powell classification of 1891, Athabaskan, Tlingit, and Haida were 
listed as independent families, though Powell mentioned the possibility 
that Tlingit and Haida might be related. The Eyak language did not appear 
in Powell's classification or in any other American work of this period; both 
the Eyak themselves and the earlier Russian sources on their language were 
overlooked by American investigators until the 193095. 

In 1894 Franz Boas concluded on "structural" grounds that Tlingit and 
Haida were genetically related, though he admitted that lexical cognates 
were few. In the American literature the Na-Dene hypothesis was first for- 
mulated by Edward Sapir in 1915. Building on the earlier observations of 
Boas and John Swanton, Sapir proposed close to 100 cognates shared by 
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Athabaskan and Tlingit or Haida (or both), which accounted, according to 
Sapir, for only a third of those he had discovered. Sapir's proposals stirred 
up guite a controversy (see Goddard 1920), for the scholarly climate, then 
as now, was not receptive to such distant relationships. Boas, in fact, had 
become disenchanted with genetic classification in general, and with the 
Athabaskan-Tiingit-Haida connection in particular, coming to feel that it 
was impossible to distinguish borrowing and areal convergence from true 
genetic affinity. In this regard his views contrasted sharply with those of 
Sapir, for whom the Na-Dene family was but one of many proposals con- 
solidating what had previously been considered independent languages or 
families. Other examples of баріг5 work in this vein are discussed іп the 
tollowing section (6.3). 

The question of whether it is possible to distinguish genetic affiliation 
from borrowing continues to divide linguists to the present day. “Diffu- 
sionists,” such as Boas, felt it was impossible to differentiate borrowed 
traits from inherited ones, particularly at deeper levels of relationship. “Ge- 
neticists,” such as Alfred Kroeber and Sapir, did not deny that the diffusion 
of traits occurs. Rather, they contended that it was never of such a massive 
or ambiguous nature that it was impossible to distinguish the inherited core 
of a language from later superficial accretions. Some scholars did even- 
tually turn this issue into an either/or proposition. The dichotomy was un- 
fortunate since, as Haas (1966: 150) stresses, “The truth of the matter is 
that both cognates and borrowed words have a story to tell about earlier 
connections.” 

Eyak was rediscovered by Frederica de Laguna in 1930 and thus came to 
the attention of American scholars for the first time. Sapir immediately rec- 
ognized that Eyak was related to, but not a member of, the Athabaskan fam- 
ily, and subsequently Fang-kuei Li and Michael Krauss reached the same 
conclusion. 

Since 1950 the Na-Dene hypothesis has received further support from 
the work of Dell Hymes and especially Heinz-Jürgen Pinnow. Quite re- 
cently Krauss (1979) and Levine (1979) have questioned the inclusion of 
Haida іп Na-Dene, but Greenberg (1987) criticizes their demurral. 


6.2.2 Present Status of Na-Dene Classification 


The nucleus of the Na-Dene phylum is the Athabaskan family, whose 
close-knit genetic unity has been recognized for over a century. Clearly re- 
lated to the Athabaskan family, but not a member of it, is the Eyak language, 
which is now on the verge of extinction. It is also universally accepted that 
Tlingit is distantly related to the Athabaskan-Eyak nucleus, but opinion is 
divided on whether Haida too is to be included in the Na-Dene complex. 
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Sapir, Hymes, Ріппоуу, and Greenberg support its inclusion; Krauss апа 
Levine remain unconvinced. 

Perhaps the most difficult problem in Na-Dene taxonomy concerns the 
internal relationships of the close-knit Athabaskan family. Krauss (1976, 
1979) doubts that any valid internal subgrouping can be established for 
Athabaskan. Nevertheless, based on the sources listed in the bibliography 
to this section, we can make certain observations about the internal struc- 
ture of the family. (1) Of the three geographic groups that make up Athabas- 
kan, only Pacific Coast Athabaskan and Apachean constitute valid genetic 
nodes. (2) The Northern group comprises several primary branches, though 
the precise number and their constituent languages are difficult to deter- 
mine because of the homogeneity of Northern Athabaskan and the paucity 
of materials on some languages. (9) Apachean is both more homogeneous 
and more closely related to the Northern languages than is the Pacific Coast 
group. (4) Among the Northern groups the greatest diversity lies in the 
west (mainly Alaska), whereas in the east (Canada) the diversity is consid- 
erably less, with mutually intelligible dialects covering large areas. (5) 
Within the Pacific Coast group there is a clear division into Oregon and 
California branches. (6) Within Apachean a singlelanguage, Kiowa Apache, 
is opposed to the remainder of the languages. (7) The Apachean and Pacific 
Coast groups appear to have broken away from different sections of the 
Northern dialect complex. Specifically, Apachean shows greatest affinity 
with Sarcee, and Pacific Coast Athabaskan with Tahltan. The Athabaskan 


List 6.2. Тһе Na-Dene Family (Sapir 1929 and others) 


NA-DENE: (2 ATHABASKAN] 
I Haida h CANADIAN: 
H ATHABASKAN-TLINGIT: і HAN-KUTCHIN 
А Tlingit ii HARE-CHIPEWYAN 
B ATHABASKAN-EYAK: іп BEAVER-SEKANI 
1 Eyak iv CARRIER- 
2 ATHABASKAN: CHILCOTIN 
a TANAINA-AHTNA v TKwalhioqua 
b INGALIK-KOYUKON vi Sarcee 
c TANANA-UPPER i APACHEAN: 
KUSKOKWIM i Kiowa Apache 
d TUTCHONE ii NAVAJO-APACHE 


e TAHLTAN-KASKA 
f TTsetsaut 
g PACIFICCOAST: 

i OREGON 

ii CALIFORNIA 





200 6 NORTH AND SOUTH AMERICA 


classification adopted here attempts to represent these seven statements, as 
well as а host of low-level relationships described іп Krauss (1976, 1979). 
The subgrouping for Na-Dene as a whole follows Sapir (1929), with the ad- 
dition of Eyak in the appropriate position (List 6.2). 
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6.3 AMERIND 


Before the European invasion of the New World in the sixteenth century, 
languages belonging to the Amerind phylum were spoken from what is to- 
day northern Canada to the tip of South America (see Maps 6.3—6.5). Dur- 
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ing the past 500 years many of the Native American tribes and languages 
have disappeared, often without a trace. Today ап Indo-European language 
(English, Spanish, or Portuguese) is politically dominant in every country 
of North, Central, and South America, and the native Indian languages еп- 
Joy second-class status, or worse. We can only roughly estimate the number 
of surviving native languages and their number of speakers, particularly for 
South America, which is much more poorly documented than North and 
Central America. My estimate is that there are currently about 600 surviv- 
ing Amerind languages, spoken by a total of roughly 18 million people. 
Were all the speakers divided evenly among the languages, the average 
would be almost 30,000 speakers per language. But the actual distribution 
of speakers among the languages is quite different. Seventeen Amerind 
languages have more than 100,000 speakers (see Table 6.1), and they ac- 
count for over 90 percent of the Arnerind population. 

The remaining Amerind languages would average about 1,400 speakers 
each. But even this figure is deceiving in that, although roughly 20 percent 
of Amerind languages are spoken north of Mexico (United States and Can- 
ada), these 140 or so languages account for less than 2 percent of the Amer- 
ind population. This fact is well illustrated in Table 6.1, where we notice 
that all the languages listed are spoken south of the United States. (Navajo, 
as we have seen, has more than 100,000 speakers, but it belongs to the Na- 
Dene phylum.) Many North American Indian languages have only a hand- 
ful of speakers left (usually old people, the younger generation having 


Table 6.1. Amerind Languages with More Than 100,000 Speakers 


Millions 
Language" of speakers Location 
Ouechua 7.00 Colombia, Ecuador, Peru, Bolivia 
Guarani 3.00 Paraguay, Brazil, Argentina 
Aymara 1.50 Peru, Bolivia 
Nahuatl (= Aztec) 1.00 Southern Mexico 
Quiche (Mayan) ‚87 Guatemala 
Yucatec (Mayan) .53 Southern Мехісо 
Zapotec ‚50 Southern Мехісо 
Mapudungu (= Araucanian) 4 Chile 
Cakchiguel (Mayan) 41 Guatemala 
Mazahua .35 Southern Mexico 
Totonac ‚27 Southern Мехісо 
Kekchi (Мауап) ‚27 Guatemala 
Mam (Mayan) .26 Guatemala 
Mixtec 25 Southern Мехісо 
Otomi 222 Southern Мехісо 
Mazatec .12 Southern Mexico 
Tzotzil (Mayan) .11 Southern Mexico 
TOTAL 17.10 


“Some of these “languages” аге in reality complexes of mutually unintelligible dialects, e.g. 
Quechua, Nahuatl, Otomi, Zapotec, Mixtec. 
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adopted English) and аге clearly destined for extinction іп the very near 
future. 

Despite having limited numbers of speakers, a number of North Ameri- 
can Indian languages are well known, having been borrowed into En- 
glish as place names, e.g. Illinois, Massachusett, Cheyenne, Alabama, 
Omaha, Dakota, Biloxi, Wichita, Mohave, Kalispel. Others have become fa- 
mous around the world via Western movies, e.g. Comanche, Muskogee 
(= Creek), Cherokee, Mohawk. 

Information on the numerous families that make up the Amerind phy- 
lum (e.g. Algonguian, Iroguoian, Uto-Aztecan) may be found in Table 8.2. 


6.3.1 History of Amerind Classification 


As I mentioned above, the collection of linguistic information for New 
World languages did not lag behind that for the rest of the world. During 
the sixteenth century the first wordlists were gathered by European ex- 
plorers, and a few grammars appeared. The seventeenth and eighteenth 
centuries saw a steady increase in the number of grammars, mostly the 
work of missionaries. Finally, in the nineteenth century, problems of ge- 
netic classification were systematically attacked for the first time. 

The first New World language to be recorded was apparently Laurentian 
(a Northern Iroquoian language), a wordlist of which was collected during 
Jacques Cartier's first voyage to the Gulf of Saint Lawrence in 1534. (The 
wordlist may actually have been taken down some time later, since Cartier 
took two Indians back to France to meet the king.) There is some dispute 
whether Laurentian is an earlier stage of the modern Huron language or a 
language closely related to Huron that became extinct. Regardless, it was 
clearly a Northern Iroquoian language. 

The Iroquoian family was also apparently the first genetic group in 
North America to be identified by European scholars. As early as 1635 Paul 
LeJeune recognized it, or at least its Northern branch. The Southern branch, 
consisting of a single language (Cherokee), was added to the family in 1797 
by Benjamin S. Barton, though the relationship had perhaps been per- 
ceived by David Zeisberger in 1768. Barton also included the Tuscarora lan- 
guage in Iroquoian for the first time; though geographically Southern 
(originally spoken in eastern North Carolina), Tuscarora belongs with the 
Northern branch, not with neighboring Cherokee. 

The other major linguistic family indigenous to the Atlantic seaboard, 
the Algonquian family, was no doubt recognized early, but I have not been 
able to determine either by whom or at what date it was first noticed. In 
northern Florida a language was spoken that was neither Iroquoian nor Al- 
gonquian; the genetic affinity of Timucua (now extinct) has been the sub- 
ject of lively controversy over the years, and we shall return to this question 
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below. A grammar of Timucua, published in 1614, is the earliest North 
American grammar listed in Rowe (1974). 

At the beginning of the nineteenth century, in North America, only the 
languages of the Atlantic Coast were documented at all well. West of the 
Mississippi River there were known to be a number of tribes, but almost 
nothing was known of their languages. Work on California languages was 
begun only in the nineteenth century by a Spanish priest, Felipe Arroyo de 
la Cuesta. 

Men such as Thomas Jefferson recognized the value of wordlists for the 
detection of genetic relationships (and hence of migratory paths of the Na- 
tive American peoples). Jefferson explicitly instructed Lewis and Clark to 
collect such wordlists as they could during their expedition across the con- 
tinent from 1803 to 1806. The materials they gathered were deposited in 
Washington, D.C., upon their return, but were quickly lost and have never 
been found. 

During the early nineteenth century typological classification was more 
common than genetic classification, and indeed the two were often not 
carefully distinguished. Stephen Duponceau and John Pickering, both law- 
yers by profession and philologists by avocation, believed that all New 
World languages were characterized by polysynthesis (i.e., what would 
constitute an entire sentence in European languages was contained іп а 
single word in so-called polysynthetic languages), a trait that was pre- 
sumed to be absent from the languages of the rest of the world. It was not 
until the end of the century that this belief in structural uniformity was fi- 
nally laid to rest, largely through the work of Eranz Boas. Polysynthesis is 
by no means characteristic of all New World languages, nor is it entirely 
absent from the rest of the world. 

The first serious attempt at a classification of North American languages 
was made by Albert Gallatin in 1836. Sometimes referred to as the Father of 
American Linguistics, Gallatin reduced the 81 tribes on which he had data 
to 28 families by comparing basic vocabulary. Twelve years later he in- 
creased the number to 32 for North America north of Mexico, but excluding 
California, for which no materials were yet available. Gallatin's classification 
is still recognized as a turning point in Amerindian classification; the 
American linguist Mary Haas (1969b: 244) has called it “a landmark in the 
history of the genealogical classification of North American languages 
[that] was not superseded until the work of Powell and his coworkers com- 
pleted what Gallatin had begun.” Gallatin, we might add, believed that all 
North American languages were ultimately related. 

The last half of the nineteenth century saw the gradual accumulation o£ 
data on languages of the western United States, in particular the previously 
unknown area of California. Toward the end of the century, there devel- 
oped a race between two men to compile the first comprehensive classifica- 
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Ноп of North American languages. Daniel Brinton was а doctor who һай 
been intensely interested in the American Indian all his life. He eventually 
guit his medical practice to devote his full time to the study of American 
Indian languages, after 1886 as a professor at the University of Pennsyl- 
vania. Brinton's competitor was an army major, geologist, and ехріогет who 
had founded the Bureau of American Ethnology in 1879 and served as its 
first director, John Wesley Powell. Brinton's classification was published in 
1891; though Powell's classification bore the same date, it did not in fact ap- 
pear until 1892. The two classifications were only roughly similar in scope, 
since Brinton's covered all of North and South America; Powell's was re- 
stricted to North America north of Mexico. Furthermore, Brinton produced 
his classification by himself, whereas Powell may be more accurately de- 
scribed as the supervisor, rather than the author, of the classification that 
bears his name, as we shall see presently. 

When Brinton published The American Race in 1891, he intended it as 
“the first attempt at a systematic classification of the whole American race 
on the basis of language" (p. ix). Brinton estimated some 80 families for 
North America and an equal number for South America, but all of them, he 
believed, shared a fundamental unity: "The psychic identity of the Ameri- 
cans is well illustrated in their languages" (p. 55). Brinton divided all the 
New World languages into five geographic groups; List 6.3 shows his classi- 
fication, with representative families from each of the groups. 

Brinton's classification was the first for South America and formed a 
foundation for all future work in this area. For North America the classifica- 
tion was soon eclipsed by Powell's, which was based on more complete in- 
formation to which Brinton had been denied access. More precisely, Brin- 
ton had asked to see the materials that had been gathered at the Bureau of 
American Ethnology. Powell responded that Brinton could see the materials 
on condition that he not use them in his work, "a proviso scarcely so liberal 
as I had expected," lamented Brinton (1891: xii). 

Powell's classification was a group effort by the staff of the Bureau of 
American Ethnology, and in the actual determination of the classification 
Powell himself played an inconsequential role. According to Kroeber, the 
final classification “was made for Powell, who was a geologist and an army 
major, by Henshaw the ornithologist when Powell found that he would 
never get his philologist-linguists like Gatschet, Hewitt, and Pilling to come 
through with the commitment of a classification" (quoted in Campbell & 
Mithun 1979: 12). The Powell classification, as it became known, identified 
58 families north of Mexico, over half of which were language isolates (i.e. 
single languages with no known relatives). Furthermore, fully two-thirds of 
the 58 families were found west of the Rocky Mountains. Thirty-two lan- 
guages were classified as language isolates: Adaizan, Atakapa, Beothuk, 
Chimariko, Tsimshian, Chinook, Chitimacha, Chumash, Esselen, Kala- 
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List 6.3. New World Language Families (Brinton 1891; partial list) 


I 


NORTHATLANTIC: 
A ESKIMO-ALEUT 
B Beothuk 

C ATHABASKAN 
D ALGONQUIAN 
E IROQUOIAN 

F MUSKOGEAN 

G COAHUILTECAN 
H CADDOAN 

I SIOUAN 

J Kiowa 


IV 


SOUTH PACIFIC: 
A CHIBCHAN 

B PAEZAN 

C TIMOTE 

D Cuna 

E CHOCO 

F Quechua 

G Aymara 


II 


NORTH PACIFIC: 
A. Tlingit 
B Haida 
C SAHAPTIN- 
МЕХ, PERCE 
D SALISH 
E CHIMAKUAN 
F PALAIHNIHAN 
G YANAN 
H MAIDUAN 
I COSTANOAN 
J Karok 
K Yuki 
L YUMAN 
M TANOAN 
N Zuni 
O Keres 


V 


SOUTH ATLANTIC: 


A TUPI 

B GE 

C ARAWAKAN 
D CARIB 

E PANOAN 

F TACANAN 
G ZAPAROAN 
H JIVAROAN 

I Mapudungu 


IH 


CENTRAL: 

A UTO-AZTECAN 
B OTOMIAN 

C Tarascan 

D TOTONACAN 

E ZAPOTECAN- 

MIXTECAN 

F MIXE-ZOQUE 

G CHINANTECAN 
H MANGUEAN 

I TEQUISTLATECAN 
J MAYAN 

K Huave 

L Subtiaba 
M Jicaque 
N Xinca 


puya, Karankawa, Keres, Kiowa, Kutenai, Tlingit, Coos, Klamath, Natchez, 
Karok, Salinan, Shasta, Sahaptin, Haida, Takelma, Timucua, Tunica, Ton- 
kawa, Yuchi, Washo, Yurok, Wiyot, Zuni. In addition to these isolates, the 
Powell classification identified the 26 families in List 6.4. 

The Powell classification has been widely praised over the years and is 
generally regarded as the second important classification of North Ameri- 
can languages, completing what Gallatin had begun a half-century earlier. 
Edward Sapir called it a “monumental work, . . . the cornerstone of the lin- 
guistic edifice in aboriginal North America” (quoted in Hoijer 1976: 9). Re- 
cently both Lamb (1964) and Campbell and Mithun (1979) have suggested 
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List 6.4. Language Families North of Mexico (Powell 1891) 
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ESKIMO-ALEUT PALAIHNIHAN COSTANOAN 
ATHABASKAN POMO WINTUN 
ALGONOUIAN YOKUTS 
CHIMAKUAN YUKI-WAPPO 
WAKASHAN COAHUILTECAN MUSKOGEAN 

SALISH CAYUSE-MOLALA TANOAN 

CADDOAN YAKONAN NUMIC 

SIOUAN MAIDUAN PIMIC-TARACAHITIC- 


IROQUOIAN 








MIWOK 


CORACHOL 











that the Powell classification is not as praiseworthy as many believe. A ge- 
netic classification may err in two ways. The first is to group languages that 
do not belong together; the second is not to combine languages (or groups 
of languages) that do belong together. On the first count the Powell classifi- 
cation has often been praised for the fact that all 58 "Powell units," as they 
are sometimes called, are valid genetic nodes. Lamb (1964: 119) objects that 
this is hardly deserving of praise "since, with the exception of Salish and 
his Piman (Pimic plus Taracahitic plus Coric), every grouping made was of 
languages so closely related that there could be no doubt of a recent com- 
mon origin." 

The main defect of the Powell classification is that it failed to recognize 
higher-level groupings, some of which had already been demonstrated in 
the literature. In this regard Powell's classification is less satisfactory than 
that of Brinton, who not only identified the Uto-Aztecan family, but gave it 
its name, remarking that "the relationship of these numerous bands is un- 
questionable" (Brinton 1891: 118). According to Powell (1891: 175), the rela- 
tionship of Aztec to the more northerly Uto-Aztecan languages had been 
claimed by Johann Buschmann (among others), “but this relationship has 
yet to be demonstrated." Lamb (1964: 115) points out that Powell was con- 
fused on this point: "Although Buschmann did not include Aztec in the 
group [Uto-Aztecan], he presented sufficient evidence to show that it be- 
longed." Though Buschmann did join the Numic languages with the Pimic, 
Taracahitic, and Corachol groups of northern Mexico, it was not until 1879 
that Gatschet explicitly added Aztec to the group, a position accepted by 
Brinton but not by Powell. 

Furthermore, Powell did not combine Miwok and Costanoan, though 
their relationship had been demonstrated by Robert Latham in 1856. Fi- 
nally, we may note that Brinton, but not Powell, recognized the affinity of 
Yuman, Seri, and Tequistlatecan, which are today considered members of 
the Hokan phylum. 
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In sum, the Powell classification is perhaps neither as perfect as support- 
ers sometimes contend nor as flawed as some critics would suggest. It ac- 
complished essentially what it set out to do, namely, to delineate all of the 
linguistic families of North America. As Powell made clear, it was not in- 
tended as a final classification, but rather as a beginning. In that capacity it 
served its purpose admirably. 

The appearance of the Powell classification (and the less influential Brin- 
ton classification) in 1891 brought to a close the initial phase in the genetic 
classification of New World languages.* All of North America (north of 
Mexico) had been fairly well charted and the languages reduced to 58 ge- 
netic units. Central and South America were less well documented, but 
here too Brinton's work constituted a baseline for future research. The sec- 
ond phase in the study of Amerindian classification took place during the 
period 1900-1930, but it was the second decade of the century that wit- 
nessed the most remarkable and controversial discoveries. This phase was 
characterized by a gradual reduction in the number of North American 
families; by 1929 Edward Sapir had managed to reduce the 58 Powell units 
to just six stocks. | 

As I mentioned above, certain of the Powell units had already been con- 
solidated before 1891; Miwok-Costanoan had been established in 1856 and 
Uto-Aztecan by 1879. So the period of consolidation really overlaps some- 
what with the period of discovery. Nevertheless, relatively little consolida- 
tion took place until the twentieth century. Some of the earliest intimations 
of more comprehensive genetic groupings are due to the German-Ameri- 
can anthropologist Franz Boas, who suggested in 1894 that "structural 
similarities" indicated a common origin for Athabaskan, Tlingit, and Haida. 
On similar grounds Boas saw a common origin for the Salish, Wakashan, 
and Chimakuan families. But by the time Sapir offered lexical support for 
grouping Athabaskan, Tlingit, and Haida in a family that he named Na- 
Dene (see section 6.2), Boas had become disillusioned with genetic classifi- 
cation, having come to feel that it was impossible to separate borrowings 
from true cognates. Although Boas remained a towering figure in Ameri- 
can anthropology until his death in 1942, his main contribution was not in 
the area of linguistic classification. Indeed, in this field he appears to have 
had a negative impact. 

One might well mark the beginning of the "reductionist phase" of Amer- 
indian classification, as it is sometimes called, with the arrival of Alfred 
Kroeber at the University of California at Berkeley in 1901. With the zeal 
characteristic of a recent Ph.D. (from Columbia University), Kroeber quickly 
set to work gathering wordlists for all California Indian languages, with the 
hope that their comparison might lead to some consolidation in the 22 fami- 


“See Voegelin & Voegelin 1967 for a discussion of the three phases of Amerindian 
classification. 
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lies that Powell had postulated for California. Тһе reductionist phase began 
slowly, and the first consolidation (linking Shasta and Palaihnihan) was 
proposed іп 1905 by Kroeber's colleague and future collaborator, Roland 
Dixon. The following year Boas assigned the extinct Adaizan language to 
the Caddoan family, and Natchez to Muskogean. In 1910 Dixon added 
Chimariko to the Shasta-Palaihnihan group, and Kroeber combined Miwok 
and Costanoan, returning to a position first advocated by Latham in 1856. 
In 1911 Kroeber noted certain “structural similarities” between Wiyot and 
Yurok, spoken side by side on the coast of northern California and, accord- 
ing to Powell, related neither to each other nor to any other language. 

In 1913 the reductionist phase began in earnest; the year was also a turn- 
ing point in Amerindian classification, and the issues raised then continue 
to reverberate to the present day. Two articles in particular startled the 
scholarly community; one was by Dixon and Kroeber, the other by Sapir. 
Dixon and Kroeber reported the results of their survey of California fami- 
lies, reducing Powell's 22 families to nine. The principal reductions resulted 
from the establishment of a Penutian family (combining Miwok-Costanoan, 
Maiduan, Yokuts, and Wintun) and a Hokan family, which was an ex- 
panded version of the Shasta-Palaihnihan-Chimariko group adumbrated by 
Dixon some years before. Karok, Pomo, Yanan, Esselen, and Yuman were 
now added to the earlier group. In addition to the newly discovered Penu- 
tian and Hokan stocks, Chumash and Salinan were joined in an Iskoman 
family; Wiyot and Yurok, whose similarities Kroeber had perceived in 1911, 
were set up as a Ritwan family. This substantial consolidation reduced the 
number of California families to Penutian, Hokan, Iskoman, Ritwan, Atha- 
baskan, Klamath, Yuki, Washo, and Numic. 

Like Boas, Edward Sapir was born in Germany, but he was brought to 
the United States at the age of five. After receiving an M.A. in Germanic 
languages, he became interested in American Indian languages through 
contact with Boas. His curiosity led to fieldwork with the Indians of Wash- 
ington, Oregon, and California, and a doctorate in linguistics from Colum- 
bia in 1909. Sapir had spent the academic year 1907—8 in Berkeley, where, 
according to Campbell and Mithun (1979: 26), he had been "caught up in 
the reductionist zeal of Kroeber and Dixon which prevailed at the time." In 
1913, the same year that Kroeber and Dixon announced their sweeping re- 
organization of California families, Sapir made the startling discovery that 
Wiyot and Yurok (joined by Dixon and Kroeber in a Ritwan group) are dis- 
placed Algonquian languages, or at the very least more closely related to 
Algonquian than to any other language. Since the Algonquian family is 
found predominantly east of the Mississippi River, and the nearest Algon- 
quian language was 600 miles (1,000 km) away, an Algonquian affinity for 
Wiyot and Yurok was totally unexpected. Moreover, although Wiyot and 
Yurok were neighboring languages on the northern California coast, it is by 
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no means clear that they are more closely related to each other than either is 
to Algonquian. Thus Dixon апа Kroeber's Ritwan might not be a valid ge- 
netic node after all, though Sapir left the possibility open. 

An irate Truman Michelson, the leading Algonquianist of the day, issued 
a stinging rebuttal to Sapir's claims the following year, offering five counter- 
arguments: (1) Wiyot and Yurok have un-Algonquian traits, (2) Sapir's mor- 
phological resemblances were "purely fanciful," (3) Sapir's alleged pronomi- 
nal similarities must be accidental, since they are found in other Indian 
languages as well, (4) Wiyot and Yurok possess certain non-Algonquian 
morphological elements, (5) the cumulative weight of arguments 1 through 
4! Michelson (1914: 367) concluded triumphantly that "enough has been 
said to show the utter folly of haphazard comparisons." 

A clearly exasperated Sapir responded to Michelson's criticism the fol- 
lowing year. First, Sapir rebuked Michelson for concentrating on the un- 
Algonquian and non-Algonquian features of Wiyot and Yurok (what I have 
called the fallacy of negative evidence in section 4.1) and failing to pay at- 
tention to the numerous threads connecting Wiyot and Yurok with Algon- 
quian: ^the really convincing positive points raised in my paper are sedu- 
lously avoided" (Sapir 1915: 189). Second, Sapir pointed out that raising 
questions about a few etymologies that may well be invalid does not under- 
mine the value of the rest of the etymologies that still massively support the 
connection. Regarding Michelson's failure to appreciate the evidential value 
of some 200 etymologies, including pronominal similarities of a most inti- 
mate nature, Sapir wrote: "There is no accounting for tastes, but it seems to 
me that such callousness deserves to be called linguistic cynicism almost as 
much as scientific sobriety" (pp. 189—90). 

Michelson added yet another rejoinder, to which Sapir replied briefly, 
closing the debate. Needless to say, neither succeeded in convincing the 
other. Inevitably other scholars took sides, with Boas joining Michelson, 
and Dixon and Kroeber supporting Sapir. It is often claimed that this "con- 
troversy was not settled until 1958, when Haas presented irrefutable evi- 
dence of the Algonquian-Ritwan relationship" (Hoijer 1976: 15). In point of 
fact Sapir had himself offered irrefutable evidence in his original 1913 ar- 
ticle. Although it is true that this proof was not universally accepted until 
the 1950's, the scholarly acceptance of proof is an altogether different ques- 
tion from that of proof itself, as the history of science has repeatedly shown. 

With the Algonquian-Ritwan debate in full swing, Sapir announced yet 
another consolidation, joining Haida, Tlingit, and the Athabaskan family in 
a stock to which he gave the name Na-Dene. This too provoked a storm of 
controversy, with Pliny Goddard the principal critic (see section 6.2). Other 
consolidations were made with increasing rapidity, including the extension 
of both the Hokan and Penutian families to include languages outside Cali- 
fornia. Hokan was extended to Mexico by Kroeber in 1915 with the addi- 


6.3 Атегіпа 213 


tion of бегі and Teguistlatecan, апа Бу Walter Lehmann іп 1920 with the 
inclusion of Tlapanecan. Іп 1917 Sapir added to Hokan both the South 
Texas Coahuiltecan family (established by John Swanton in 1915) апа the 
Iskoman family of California that had been left ungrouped by Dixon and 
Kroeber (1913). During the 1920's Sapir extended Penutian northward to in- 
clude languages from Oregon, Washington, and British Columbia, and in 
1929 he added a Mexican branch to Penutian comprising Mixe-Zoque and 
Huave. 

During this same period scholars were also at work consolidating Powells 
groups outside California. In 1910 John Harrington presented evidence 
linking Kiowa with Tanoan and suggested that this expanded Tanoan family 
was related to Uto-Aztecan, a position subsequently adopted by Sapir (1929) 
and supported by Whorf and Trager (1937). In the Southeast Swanton 
joined Tunica, Chitimacha, and Atakapa in 1919, апа Natchez and Musko- 
gean in 1924. Leo Frachtenberg (1920) revived the Mosan group (= Salish, 
Wakashan, and Chimakuan), which had been suggested (but then aban- 
doned) by Boas at the close of the nineteenth century. In 1921 Sapir sug- 
gested an affinity between Yuchi, a language isolate, and the Siouan family. 
A few residual problems in California were also cleared up, with Dixon and 
Kroeber assigning Washo to Hokan in 1919 and Sapir adding Klamath to 
Penutian in 1929. 

Im 1929 Sapir published a classification of North and Central American 
languages that, as it turned out, both summarized and climaxed the reduc- 
tionist phase of Amerindian classification. This classification, which dif- 
fered in only a few details from опе he proposed in Sapir (1921a), divided 
the languages north of Mexico into six stocks and those of Central America 
into 15. Furthermore, Sapir asserted that “Middle America, in spite of its 
special cultural position, is distinctly a part of the whole North American 
linguistic complex and is connected with North America by innumerable 
threads” (1929: 141). The Sapir classification is shown in Lists 6.5 and 6.6. 
List 6.6 excludes certain groups that Sapir had classified as branches of 
more northerly families (e.g. Tlapanecan, Tequistlatecan, Aztecan), though 
Mixe-Zoque-Huave appears іп both lists. Sapir considered his classification 
"suggestive but not demonstrable in all its features at the present time" 
(p. 139), and he foresaw the likelihood of at least some of the Central 
American families being linked to stocks farther north. 

The 1930's and 1940's were, with a few exceptions, uneventful for Amer- 
indian classification. Freeland and Sapir (1930) added support to the con- 
nection between Mixe-Zoque and (California) Penutian, and Whorf and 
Trager (1937) offered evidence for Aztec-Tanoan such that the connection 
became generally accepted. During the 1930's Whorf is reported to have 
proposed a Macro-Penutian stock consisting of Penutian, Aztec-Tanoan, 
Zuni, Mayan, Totonacan, Tunica, Xinca, and Lenca, but the evidence was 





List 6.5. North American Families (Sapir 1929) 


ESKIMO-ALEUT 


ALGONQUIAN-MOSAN: 
I ALGONQUIAN-RITWAN: 
А ALGONQUIAN 
?B Beothuk 
C RITWAN: 
1 Wiyot 
2 Yurok 
II Kutenai 
III MOSAN: 
A WAKASHAN 
B CHIMAKUAN 
C SALISH 


NA-DENE: 
I Haida 
П ATHABASKAN-TLINGIT: 
A Tlingit 
B ATHABASKAN 
PENUTIAN: 
I CALIFORNIA: 
A MIWOK-COSTANOAN 
B YOKUTS 
C MAIDUAN 
D WINTUN 
П OREGON: 
А Takelma 
B COAST: Coos, Siuslaw, 
Yakonan 
C Kalapuya 
III Chinook 
IV Tsimshian 
V PLATEAU: 
A Sahaptin 
B MOLALE-CAYUSE 
C Klamath 
VI MEXICAN: 
А Huave 
B MIXE-ZOQUE 


AZTEC-TANOAN: 

I UTO-AZTECAN: 
A AZTECAN 
B PIMIC 
C NUMIC 

II KIOWA-TANOAN: 
А Kiowa 
B TANOAN 

?Ш Zuni 





HOKAN-SIOUAN: 
I HOKAN-COAHUILTECAN: 
A HOKAN: 
1 NORTHERN: 
a KAROK-PALAIH- 
NIHAN: 
i Karok 
ii Chimariko 
iii SHASTA- 
PALAIHNIHAN 
b YANAN 
c POMO 
2 Washo 
3 ESSELEN-YUMAN: 
a Esselen 
b YUMAN 
4 SALINAN-SERI: 
a Salinan 
b Chumash 
c Seri 
5 TEQUISTLATECAN 
B TLAPANECAN 
C COAHUILTECAN: 
1 Tonkawa 
2 COAHUILTECAN PROPER: 
a Coahuilteco 
b Cotoname 
c Comecrudo 
3 Karankawa 


II Yuki 
III Keres 
IV TUNICA-CHITIMACHA: 
A TUNICA-ATAKAPA 
B Chitimacha 
V IROQUOIAN-CADDOAN: 
A IROQUOIAN 
B CADDOAN 
VI EASTERN: 
А SIOUAN-YUCHI: 
1 Yuchi 
2 SIOUAN 
B NATCHEZ-MUSKOGEAN: 
1 Natchez 
2 MUSKOGEAN 
?3 Timucua 
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List 6.6. Central American Families (баріт 1929) 


Chinantec MISKITO-SUMO-MATAGALPA Paya 
TJanambre MIXE-ZOQUE-HUAVE Tarascan 
Jicaque MIXTEC-ZAPOTEC TOTONACAN 
Lenca TOlive TWaicuri 





MAYAN OTOMIAN-MANGUE Xinca 


never published. Less dramatic but still controversial was McQuown's (1942) 
proposal of a Mexican branch of Penutian combining Mayan and Totonacan 
with Mixe-Zoque. 

These few contributions aside, interest in classificatory problems was 
dormant and did not reawaken until the 1950's. This dormant period is 
sometimes blamed on Sapir's “premature” 1929 classification, which sup- 
posedly stifled further interest. It seems more likely that this waning of in- 
terest in taxonomy reflects the changing zeitgeist of the 193065, when meth- 
odological issues increasingly supplanted descriptive and classificatory 
problems as the focus of scholarly attention. Furthermore, the structure of 
language was increasingly investigated from a synchronic (ahistorical), 
rather than a diachronic (historical), perspective, and this new orientation 
also diminished interest in classification. Finally, as we have just seen, 
problems of classification had been thrashed out in minute detail over a pe- 
riod of 20 years (1910-30). Eventually such arguments become repetitive, 
and the quarrels tiresome even to the participants; their exhaustion seems 
to have been at least a contributing factor in the almost total cessation of 
taxonomic research. 

In whatever field classification is carried out, scholars tend to divide 
themselves into two categories. At one pole there are the “lumpers,” who 
manifest a predilection for combining entities into more comprehensive 
groups. At the other are the "splitters," who tend to divide groups into a 
greater number of distinct entities. The second phase of Amerindian classi- 
fication was clearly dominated by the lumpers, with Dixon, Kroeber, and 
Sapir leading the way. During the third phase of Amerindian classification, 
running from roughly 1950 to the present, a reaction set in against what 
were perceived as Sapir's excesses, and splitters have dominated the scene 
now for three decades. Mary Haas, a student of Sapir's and probably the 
leading Amerindian scholar of the second half of the twentieth century, has 
been a leading advocate of a more conservative approach to genetic classifi- 
cation than Sapir's. In addition to doing fieldwork throughout much of 
North America, Haas directed the survey of California Indian languages at 
the University of California (Berkeley) that has produced a wealth of new 
linguistic material on California languages. She has also been responsible 
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for the training of a good number of those scholars currently engaged in 
Amerindian research. 

As Sapir himself candidly admitted, evidence had not been published 
for all the groupings in his 1929 classification. When scholars returned to 
classificatory problems in the 1950's, in part encouraged to do so by the new 
materials becoming available, they adopted a relatively conservative stance 
that accepted only those portions of Sapir's classification for which evidence 
had been adduced. Their view of the Sapir classification was, therefore, 
roughly as follows. Eskimo-Aleut and Na-Dene were accepted unchanged, 
though it was sometimes claimed, despite Sapir's work, that "the Nadene 
super-stock, which unites Athapaskan, Haida, and Tlingit, is still unsup- 
ported" (Hoijer 1976: 15). Aztec-Tanoan was accepted, except for the inclu- 
sion of Zuni, which was reinterpreted as a language isolate. Penutian con- 
tinued to be considered a valid genetic group, though the inclusion of the 
Mexican branch remained in dispute. Algonquian-Mosan was broken up 
into four independent families (Algonquian-Ritwan, Salish, Wakashan, 
Chimakuan) and two language isolates (Beothuk and Kutenai). 

The grouping that came in for the most criticism was Hokan-Siouan, 
which was reputedly Sapir’s “wastepaper basket" stock, into which he 
lumped everything else once the relatively clear-cut families had been fac- 
tored out. Hokan-Siouan was broken up into six families (Hokan, Tunica- 
Chitimacha, Iroquoian, Caddoan, Siouan, Natchez-Muskogean) and four 
isolates (Yuki, Keres, Yuchi, Timucua). 

The opening salvo of the third phase of research on Amerindian classifi- 
cation was Haas's (1951) proposal of a Gulf group, combining Sapir's Tunica- 
Chitimacha and Boas's Natchez-Muskogean. In 1953 Morris Swadesh, like 
Haas a student of Sapir's, published evidence supporting the Mosan (Sa- 
lish, Wakashan, Chimakuan) stock, the existence of which had been sug- 
gested by Boas in 1894 and Frachtenberg in 1920. That same year Swadesh 
published a paper with Joseph Greenberg claiming that the Jicaque lan- 
guage of Honduras belonged to the Hokan phylum. In 1958 Haas demon- 
strated the existence of sound correspondences between Algonquian and 
Gulf, thus for the first time directly linking elements of two of Sapir's six 
stocks. Furthermore, if Algonquian and Gulf form a valid genetic group, 
then Sapir's Algonquian-Mosan does not. That same year Haas also discov- 
ered sound correspondences connecting Wiyot and Yurok with Algon- 
quian, effectively ending the Sapir-Michelson debate. 

In the early 1960's a need was felt to reassess the status of Amerindian 
classification, and toward that end a conference, attended by 30 specialists, 
was held in 1964. That conference reached a "consensus" classification, 
summarizing “Phase Ш” work to that point. This consensus classification, 
discussed in Voegelin and Voegelin (1967), is illustrated in List 6.7. 

Although there was little activity in Amerindian classification following 
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List 6.7. North American Families (Voegelin & Voegelin 1967) 


CHUKCHI-ESKIMO: 
I ESKIMO-ALEUT 
II CHUKCHI-KAMCHATKAN 


NA-DENE: 
I Haida 
II Tlingit 
III ATHABASKAN (including Eyak) 
MACRO-ALGONQUIAN: 
I ALGONQUIAN-RITWAN: 
A ALGONQUIAN 
B Wiyot 
C Yurok 
II GULF: 
А NATCHEZ-MUSKOGEAN 
B TUNICA-CHITIMACHA 
C Tonkawa 


MACRO-SIOUAN: 
I IROQUOIAN 

II SIOUAN 

III Catawba 

IV CADDOAN 

V Yuchi 


HOKAN: 
I Shasta 
II PALAIHNIHAN 
Ш POMO 
IV YUMAN-SERI 
V TEOUISTLATECAN 
VI TLAPANECAN 
VII ҒҮАМАМ 
ҮШ Salinan 
ІХ TChumash 
X TComecrudo 
XI TChimariko 
ХП Washo 
XIII Karok 
XIV COAHUILTECAN 
XV TEsselen 
XVI Jicaque 


PENUTIAN: 
I YOKUTS 
П MAIDUAN 
III WINTUN 
IV MIWOK-COSTANOAN 
V SAHAPTIN-NEZ PERCE 
VI TCayuse 
VII tMolale 
VIII Coos 
IX YAKONAN 
X KALAPUYA 
XI Takelma 
XII Chinook 
XIII Tsimshian 
XIV Zuni 
XV MIXE-ZOQUE 
XVI MAYAN 
XVII CHIPAYA-URU 
XVIII TOTONACAN 
XIX Huave 


AZTEC-TANOAN: 
I TANOAN 
П UTO-AZTECAN 


OTO-MANGUEAN 
Keres 


YUKI-WAPPO 
TBeothuk 
Kutenai 
TKarankawa 


CHIMAKUAN 
SALISH 


WAKASHAN 
TTimucua 
Tarascan 
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List 6.8. North American Families (Campbell & Mithun 1979) 
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ESKIMO-ALEUT Washo TMolale 
NA-DENE TSalinan Coos 
Haida Karok tAlsea 
ALGONQUIAN- TChumash Siuslaw 
RITWAN TCotoname TTakelma 
MUSKOGEAN tComecrudo Kalapuya 
TNatchez TCoahuilteco Chinook 
tAtakapa tAranama Tsimshian 
Chitimacha tSolano Zuni 
tTunica tEsselen TANOAN 
Tonkawa Jicaque UTO-AZTECAN 
SIOUAN TLAPANECAN Keres 
IROQUOIAN TEQUISTLATECAN YUKI-WAPPO 
CADDOAN YOKUTS TBeothuk 
Yuchi MAIDUAN Kutenai 
YUMAN WINTUN TKarankawa 
Seri MIWOK CHIMAKUAN 
POMO COSTANOAN SALISH 
PALAIHNIHAN Klamath WAKASHAN 
Shasta Sahaptin TTimucua 
TYANAN tCayuse tAdai 
tChimariko 





the 1964 conference, and certainly no major discoveries, a second confer- 
ence on the classification of New World languages was held in 1976, with 
the results published in Campbell and Mithun (1979). This second “consen- 
sus classification,” shown in List 6.8, increased the number of independent 
genetic units in North America (including some, but not all, of the indige- 
nous languages of Mexico) to a whopping 63, dismembering almost every 
genetic group that had previously been postulated, including not only the 
extended versions of Hokan and Penutian, but even their California nuclei, 
which had been the cornerstones of Phase II research. According to Camp- 
bell and Mithun (1979: 39), this "classification . . . is conservative and not 
very controversial, but rather represents something of an encapsulation of 
current received opinion.” 

It seems rather doubtful that all the participants in this conference would 
truly subscribe to every aspect of Campbell and Mithun’s classification. In 
particular, it seems difficult to believe that Haas could accept Campbell and 
Mithun's (1979: 43) assertion that "as yet there has been no sufficient dem- 
onstration that any two of the variously proposed branches [of Hokan] are 
actually related genetically," when only a decade earlier she wrote that 
Sapir's (1917) Hokan cognates "stand up remarkably well in the light of 


6.3 Amerind 219 


more recent knowledge, and the paper therefore ought to have been ade- 
quate to establish the validity of the Hokan stock” (Haas 1964: 74). 

For Central America Campbell (1979) contributes an equally conser- 
vative classification. Omitting groups already listed for North America (e.g. 
Uto-Aztecan, Tlapanecan, Tequistlatecan, Seri), he identifies the Central 
American groups shown in List 6.9. 

In contrast to the substantial activity on North and Central American 
Indian languages during this century, South America has been largely ig- 
nored. Those few scholars who did conduct taxonomic research on South 
American languages generally contented themselves with a simple listing 
of numerous diverse groups, and no major consolidations of the kind that 
took place in North America were proposed until Greenberg's address іп 
1956 and subsequent work. The initial classification of South American Іп- 
dian languages by Brinton (1891) was followed by similar attempts by Alex- 
ander F. Chamberlain (1903, 57 families; 1913, 83 families), Paul Rivet (1924, 
77 families), and Cestmír Loukotka (1935, 94 families; 1944, 114 families; 
1968, 117 families). Most of the linguistic research on Central American and 
especially South American Indian languages since 1950 has been the work 
of the Summer Institute of Linguistics (SIL), which began work in Central 
America in 1935 and in South America in 1946. Although SIL linguists have 
produced a wealth of new descriptive material and many low-level genetic 
classifications, they have not attempted an overall classification of American 
Indian languages. 

Returning to North America, what are we to make of the apparent re- 
gression in Amerindian classification to a position little different from 
views prevailing at the start of this century? Can it be that Dixon, Kroeber, 
and Sapir, who were thought to have made seminal discoveries in Amerin- 
dian classification, had merely deluded themselves? Is it possible that Sapir, 
so often described as a genius in questions of taxonomy, had in fact over- 
stepped the bounds of reasonable genetic classification to the extent implied 
in Lists 6.7 and 6.8? This apparently is what “current received opinion” 
would have us believe. Were Sapir alive today to witness the dismember- 
ment of much of his life's work, I suspect he would feel that “such cal- 
lousness deserves to be called linguistic cynicism almost as much as scien- 


List 6.9. Central American Families (Campbell 1979) 








OTO-MANGUEAN Tarascan 
Jicaque Cuitlatec 
Huave Xinca 
MIXE-ZOQUE Lenca 
MAYAN Paya 
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tific sobriety.” Sapir, however, died in 1939, so we cannot turnto him fora 
solution to our dilemma. 

Fortunately for Amerindian classification there was one scholar working 
outside of the normal “Phase Ш” paradigm. This was the American linguist 
Joseph Greenberg, whose contributions to African classification (see Chap. 
3) and Oceanic classification (see section 5.2) have already been examined. 
When Greenberg had completed his classification of African languages in 
the mid-1950's, he began to investigate the two remaining areas of the 
world, New Guinea and South America, where genetic classification had 
not progressed beyond the simple enumeration of numerous small groups. 
Using multilateral comparison (see section 3.6), Greenberg quickly realized 
that South American Indian languages were connected to those of Central 
and North America "by innumerable threads," as Sapir (1929) had de- 
scribed the cohesiveness of North and Central America. Greenberg re- 
ported his preliminary findings in 1956 at the Fifth International Congress 


List 6.10. New World Families (Greenberg 1960) 


ESKIMO-ALEUT 
NA-DENE 


AMERIND: 
I NORTHERN BRANCHES (e.g. ALGONQUIAN, 
SALISH, SIOUAN, IROQUOIAN) 
II HOKAN 
III PENUTIAN (including MIXE-ZOQUE, Huave, 
MAYAN, and TOTONACAN) 

IV А?ТЕС-ТАМОАМ 

V OTO-MANGUEAN 

VI Tarascan 

VII MACRO-CHIBCHAN: 
A CHIBCHAN 
B PAEZAN 
VHI ANDEAN-EQUATORIAL: 
A ANDEAN 
B JIVAROAN 
C MACRO-TUCANOAN 
D EQUATORIAL (including ARAWAKAN and 
TUPI) 

IX GE-PANO-CARIB: 
A MACRO-GE 
B MACRO-PANOAN 
C Nambikuara 
D Huarpe 
E MACRO-CARIB 
F Taruma 
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of Anthropological and Ethnological Sciences; а brief summary of the re- 
sults (see List 6.10) was published in the Proceedings of the Congress in 
1960, though without supporting evidence. 

Greenberg's startling claim that all the languages of North, Central, and 
South America could be assigned to only three genetic families (Eskimo- 
Aleut, Na-Dene, Amerind) has been generally ignored by Amerindian spe- 
cialists. It is not even mentioned in Campbell and Mithunss (1979) survey of 
Amerindian classification. Over the past quarter-century, with concen- 
trated attention since 1970, Greenberg has compiled comparative wordlists 
of New World languages. An estimated quarter-million entries have been 
collected in 20-odd notebooks (copies of which may be found in Stanford 
University's Green Library), drawing together information on New World 
languages from roughly 3,000 different sources. This is the largest and 
most detailed compilation of New World vocabularies ever assembled. 

Greenberg has previously revealed portions of the evidence for his tri- 
partite classification (1979, 1981), but it is only in his most recent work 
(1987) that the extraordinary breadth and depth of the evidence becomes 
apparent. In support of the Amerind phylum Greenberg offers over 300 
etymologies (involving basic vocabulary and pronouns, the elements of 
language least susceptible to borrowing) connecting different Amerind 
groups from Canada to the tip of South America. The nature and distri- 
bution of the evidence are such that the existence of the Amerind phylum 
may be considered established. Furthermore, evidence has recently been 
Offered from dentition and gene distributions in the New World that cor- 
roborates Greenberg’s tripartite classification of the peoples in the Western 
Hemisphere, although he of course based his division solely on linguistic 
evidence. The convergence of these three independent lines of inquiry is 
discussed in Greenberg, Turner, and Zegura (1985). 

As revolutionary as Greenberg’s proposals are, they were foreshadowed 
in the earlier literature in a number of interesting ways. As I have men- 
tioned, some nineteenth-century linguists asserted that all New World lan- 
guages were genetically related in a single family, the defining characteris- 
tics of which were essentially typological (i.e. the so-called polysynthetic 
structure of Amerindian languages). These early claims were not, however, 
supported by evidence and must be considered to reflect a certain parti pris 
of the early investigators. Furthermore, Boas finally put an end to the long- 
lived myth that all Amerindian languages are polysynthetic, thus under- 
mining what had been taken to be their genetic basis. Finally, nowhere in 
the nineteenth-century literature is there any awareness of the individuality 
of the Na-Dene languages with respect to other Amerindian groups; Atha- 
baskan (later Na-Dene) was simply regarded as one group among many. 
For all these reasons I do not believe there were any true precursors to the 
Greenberg classification in the nineteenth century. 
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The first intimation of the Amerind phylum may be traced to the Sapir- 
Michelson controversy (discussed above), though neither participant was 
aware at the time of the import of their dispute over Amerindian pronouns. 
Part of the evidence that Sapir had offered in support of Algonquian- 
Ritwan was the presence of a first-person pronoun (T) with n- as the char- 
acteristic consonant and a second-person pronoun (‘you’) characterized by 
m-. Michelson responded that this same distribution of n- and m- was found 
in a fair number of North American languages in addition to Algonquian, 
Wiyot, and Yurok. To this assertion Sapir replied, lamely, that Michelson 
was confusing a few sporadic resemblances with a set of significant cog- 
nates. In reality, however, Michelson was citing evidence that the Algon- 
quian-Ritwan family was merely one component of an even larger genetic 
group, and it is this larger grouping, not just Algonquian-Ritwan, that is 
characterized by n- and m-, respectively, in the first- and second-person 
pronouns. Interpreted in this manner, Michelson's critique in no way affects 
Sapir's claim of genetic affinity between Algonquian, Wiyot, and Yurok. 

Several years later, when reviewing a grammar of a Bolivian Indian lan- 
guage, Sapir was struck by the presence of a second-person pronoun in m-, 
remarking (1917a: 184) that "the curiously widespread American second 
singular m- meets us here once more (mi ‘thou’).” A year later he discovered 
that not only was second-person т- found in South America, but so too 
was first-person n-. At this point Sapir realized that there must be an enor- 
mous family of languages stretching from North America through Central 
America and deep into South America, for he wrote in a letter dated Au- 
gust 1, 1918: "Getting down to brass tacks, how in the Hell are you going to 
explain general American n- T except genetically? It's disturbing, I know, 
but (more) non-committal conservatism is only dodging, after all, isn't it? 
Great simplifications are in store for us. . . . Itseems to me that only now is 
American linguistics becoming really interesting, at least in its ethnological 
bearings" (quoted in Darnell & Sherzer 1971: 27). 

Although Sapir did not pursue this line of inquiry in his later work, I 
believe this early recognition of two distinguishing traits of the Amerind 
phylum clearly foreshadows Greenberg's conclusions some four decades 
later. Interestingly, one critic of Sapir's work in genetic classification, his 
former teacher Franz Boas, was also aware of the embarrassingly wide dis- 
tribution of certain Amerindian pronouns, suggesting in 1917 that "the fre- 
quent occurrence of similar sounds for expressing related ideas (like the 
personal pronouns) may be due to obscure psychological causes rather than 
to genetic relationship" (quoted in Haas 1966a: 102). 

Not only was Sapir aware of the apparent existence of an enormously 
widespread American stock by 1918, but even earlier he had remarked on 
the fact that Eskimo-Aleut and Na-Dene seemed to stand apart from other 
New World languages. In his 1916 study he suggested that "the peopling of 


6.3 Amerind 223 


America was not а single historical process, but а series of movements of 
linguistically unrelated peoples, possibly from different directions and cer- 
tainly at very different times. This view strikes me as intrinsically highly 
probable. As the latest linguistic arrivals in North America would probably 
have to be considered the Eskimo-Aleut and the Na-Dene (Haida, Tlingit, 
апа Athabaskan)” (рр. 454—55). 

Other scholars, over the years, have perceived at least part of the Amer- 
ind phylum. Benjamin Whorf (1935: 608) proposed a Macro-Penutian phy- 
lum incorporating Penutian, Aztec-Tanoan, Zuni, Mayan, Totonacan, and 
possibly Tunica. Since Whorf published no evidence for his assertion, we 
cannot be sure whether he had recognized a large portion of the Amerind 
family, but it is not unlikely. Іп the mid-1950's Morris Swadesh (1954, 1956) 
postulated a Penutoid stock containing languages of North, Central, and 
South America. In support of Penutoid, Swadesh (1954: 309—11) offered 13 
Penutoid etymologies and two pronouns: ni/na 'l' and mi/ma ‘you,’ the first 
of which has as common variants ña and, especially in South America, ja. 
In 1960 he wrote that "recent research seems to show that the great bulk 
of American languages form a single genetic phylum going far back in 
time. . . . Eskimo-Aleutian and Nadenean seem to stand apart, and may 
therefore represent later waves of migration" (p. 896). In his later work 
Swadesh considerably revised his classification of New World languages, in 
line with his growing belief that all the world's languages were related in a 
vast network of connections. However, he provided little evidence for his 
sweeping revisions, which were based primarily on lexicostatistics. Al- 
though Swadesh's later taxonomic work diverged significantly from his ear- 
lier views, his 1954 and 1960 papers, at least, seem to foreshadow the 
Amerind phylum and its distinctness from both Na-Dene and Eskimo- 
Aleut. Recently both Matteson (1972) and Key (1981a,b,c, 1983) have pre- 
sented evidence that is consonant with Greenberg's Amerind phylum. 

In addition to these possible precursors of Greenberg's Amerind stock, 
one encounters not infrequently in the literature remarks that all but admit 
the existence of an enormous Amerindian family: "It must be kept in mind 
that most of the specialists who have immersed themselves in the study of 
large numbers of American Indian languages believe that almost all of them 
are genetically related to one another" (Kaufman 1974: 960). 

Yet another factor should have led to the recognition of broad genetic 
connections in the New World, but in fact did not. Since the nineteenth 
century, and with increasing frequency in recent decades, numerous schol- 
ars have observed striking similarities between supposedly "independent" 
genetic groups. In the bibliography at the end of this section I have in- 
cluded (under Amerind) a representative sample of various proposals link- 
ing two or more Amerind groups. The groups they connect are indicated in 
brackets following each entry. Connections of this sort have been so numer- 
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ous and widespread that it has seemed at times that the comparison of any 
two Amerind groups produced a list of tantalizing similarities. Phase III lin- 
guists reacted to these seemingly pervasive interconnecting genetic links 
by dismissing them all as random "noise," much in the spirit of Michelson 
(1914). For example, Goddard (1979: 376), in discounting Sapir's proposed 
Hokan affiliation for the Coahuiltecan group (which he also discounts), re- 
marks "how easy it is to find lexical resemblances between groupings of 
large numbers of poorly recorded languages." 

It is time we face the fundamental question: how can we reconcile 
Greenberg’s reduction of New World phyla to just three (Eskimo-Aleut, Na- 
Dene, Amerind), and the compelling evidence behind it, with the prevail- 
ing "consensus" view (summarizing Phase III research) that the New World 
contains dozens, if not hundreds, of independent families? Why was a ge- 
netic unit that was sensed by Sapir (and perhaps others), and proved by 
Greenberg, invisible to scholars working within the Phase III paradigm? 
How are we to explain the failure of Phase III scholars to extend and con- 
solidate the splendid foundation that had been bequeathed them by Dixon, 
Kroeber, and Sapir? And what was it that eventually led them to try to de- 
stroy the foundation itself? 

In fact, a whole complex of factors has contributed to the failure of the 
Phase III paradigm. One of the primary factors was that Phase III scholars 
adopted different standards of proof of genetic affinity. In the classifica- 
tions of Gallatin, Brinton, Powell, Kroeber, and Sapir, lexical and gramma- 
tical resemblances, if sufficient in number and plausible both phonologi- 
cally and semantically, were the cornerstone of proof of genetic relationship. 
Their methods are often referred to disparagingly by Phase III scholars as 
the "inspection of crude word lists," the "'laundry-lis approach," or 
"simple lexical scanning" (Campbell & Mithun 1979: 10, 50, 53). In lieu of a 
set of shared cognates, Phase III scholars have relied on either the establish- 
ment of sound correspondences or the reconstruction of the parent stock as 
the ultimate proof of genetic affinity. 

The changing standard of proof reflects, I believe, the changing academic 
climate from the 1930's on. As I have noted, during the 1930's methodologi- 
cal issues became paramount. When taxonomic questions were again 
broached in the 1950’s, scholars felt it necessary to develop a more rigorous 
theoretical foundation than the impressionistic cognate sets of earlier schol- 
ars. For this reason sound laws (sometimes referred to as "phonetic equa- 
tions" to emphasize their supposed mathematical precision) and reconstruc- 
tion were substituted for the earlier criterion of cognation. The deleterious 
effects of this decision were discussed in section 3.6. The fact is, the new 
standard of proof not only was incapable of extending Phase II research, it 
was even unable to substantiate Phase II results and consequently led di- 
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rectly to the sense of disillusionment апа frustration that is apparent іп 
Campbell and Mithun (1979), Shipley (1980), and elsewhere. 

Equally disastrous for Amerindian classification was the rise in popu- 
larity of the binary approach to taxonomy, which involves the comparison 
of two languages (or two groups), usually chosen for comparison quite by 
accident, in order to establish a genetic connection between the two. As 
Lamb (1964) pointed out, practitioners of the binary approach fail to distin- 
guish language classification from the establishment of genetic relationship. 
Although the binary approach may suggest а genetic relationship, it tells us 
next to nothing about classification. According to Lamb (1964: 109), "The 
question that should be of greatest concern for language classification . . . 
is not whether or not a given pair of languages is related, but how close or 
distant the relationship is." 

To illustrate the deficiencies of the binary approach, let us consider a 
specific example. In 1964—65 Roland Olson sought to show that the Chipaya 
language of Bolivia is related to the Mayan languages of Central America. 
Even if one grants that a genetic relationship between the two is indicated 
by his cognate sets, there is no evidence that Chipaya and Mayan are more 
closely related to each other than either is to other languages. In light of 
Greenberg's findings it is now clear that Chipaya and Mayan are related (as 
members of Amerind), but they have no special relationship to each other. 
Mayan remains a member of Penutian, and Chipaya belongs to the Ara- 
wakan branch of Equatorial. The only reason Mayan was even compared 
with Chipaya is that Olson happened to have first-hand experience with 
both groups. Comparing Mayan with Chipaya is like comparing Irish and 
Bengali to prove they are related. Although such a proof might be possible, 
it would be tortured and incomplete, for the simple reason that much of the 
evidence demonstrating the affinity of Irish and Bengali (or Chipaya and 
Mayan) is contained in neither language. For amplification of this point see 
section 3.5. The failure to understand this fundamental principle of genetic 
classification has unfortunately led to a great deal of wasted time and en- 
ergy. We may hope that the day when arbitrary pairs of languages are com- 
pared under the guise of genetic classification is now over. 

The binary approach and the insistence on regular sound correspon- 
dences and/or reconstruction are probably the most serious defects of 
Phase Ш research, but at least a half-dozen others warrant brief mention: 

1. Silverstein (1979: 651) observes that "there has been for a long time a 
peculiar development of comparative-historical Amerindian linguistics. . 
This trend has sought doctrinally to work ‘from the bottom up’ and from 
present to past unidirectionally." Ignoring the lessons of comparative Indo- 
European linguistics, many Americanists have come to believe that to reach 
greater time depths and more comprehensive classifications one must first 
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reconstruct the proto-languages for all the low-level genetic groups. Only 
after these proto-languages (e.g. Proto-Penutian, Proto-Hokan, Proto- 
Algonquian) have been established is it legitimate to compare them, in 
turn, to see if greater consolidations are possible (i.e. Amerind). Except in 
Phase III Amerindian linguistics, it has been firmly established since the 
nineteenth century that it is more fruitful to attack genetic classification si- 
multaneously from the bottom up and from the top down, since each ap- 
proach informs the other. The attempt to turn genetic classification into a 
kind of linguistic Rubik's cube constitutes a deviation from established 
principles of historical linguistics. 

2. As is often the case, political boundaries play a large role in determin- 
ing who studies which languages. American linguists tend to study lan- 
guages of the United States, perhaps keeping abreast of developments 
across the border in Mexico and Canada, but they are very unlikely to pay 
much attention to South America. Within the United States a similar com- 
partmentalization occurs between states. Yet from the perspective of ge- 
netic linguistics, these arbitrary political boundaries are meaningless, and 
taxonomic research that pays them any attention at all cannot but suffer as a 
result. In addition, of course, there is great disparity in the relative wealth 
of various countries, as well as very different attitudes toward the Native 
American population. Both factors have no doubt contributed to South 
America's relative isolation from the rest of Amerindian linguistics. 

3. Paralleling the arbitrary political boundaries are the equally arbitrary 
institutional and personal boundaries that have increasingly dictated that a 
scholar confine himself to one language, or at most one language family. As 
Langdon (1979: 597) candidly admits, "The number of scholars who have 
had direct contact with more than one branch of Hokan is extremely lim- 
ited." Key (1979: 17) reports that "in South America it is rare that an inves- 
tigator knows more than one language well." Obviously the extreme spe- 
cialization characteristic of contemporary academic studies makes the 
discovery of deep genetic relationships unlikely. 

4. The consensus classifications summarizing Phase III research fail to 
take into account that it is impossible for individual scholars of specific lan- 
guages and families to distinguish general Amerind traits from language- 
specific or family-specific ones. A Hokan specialist, for example, cannot 
distinguish the roots that Hokan shares with other Amerind groups from 
those that define Hokan itself unless he is familiar with all the potentially 
related Amerind groups. In fact, the term consensus classification is some- 
thing of a misnomer, implying as it does that a group of scholars have all 
examined a body of evidence and have reached similar conclusions based 
on it. In reality, each scholar has examined a different set of data (his particu- 
lar language or family), and the consensus refers to each scholar's willing- 
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ness to accept other scholars' work if they will accept his. The pitfalls and 
dangers of such "classification by committee" are too obvious to dwell on 
and represent once again a basic misunderstanding of the nature of genetic 
classification on the part of Phase III scholars. 

5. Phase III linguists tend to believe that combining groups that do not 
belong together constitutes a far more serious error than not combining 
groups that do. This bias is reflected most clearly in the second consensus 
classification, which is little more than an enumeration of obvious low-level 
genetic groups. In reality, as Lamb (1964: 107) observes, “It is just as bad to 
leave apart two related groups as it is to put together two unrelated ones." 

6. Many scholars believe it is impossible to classify a language unless 
one possesses complete, linguistically sophisticated materials. A short 
wordlist collected by a person with no linguistic training is therefore con- 
sidered of little or no value. In reality, as Kroeber, Sapir, and Greenberg 
realized full well, it is often possible to classify a language accurately on the 
basis of the most meager and poorly recorded materials if they happen to 
contain a few diagnostic cognates. As any historian knows, all information 
is imperfect in some respect, but “to throw out all evidence that does not 
attain to an arbitrary standard of fineness is sterile as well as unreasonable" 
(Dixon & Kroeber 1919: 113). 

It should be abundantly clear from the preceding discussion that genetic 
classification carried out within the Phase III paradigm stood no chance of 
making any significant progress. Operating with a methodological mythol- 
ogy largely of its own creation, Phase III represents not a continuation of 
the pioneering work of Phase II, but a blind alley that led nowhere. Rather it 
is the work of Greenberg that picks up where Sapir left off. In both meth- 
odology and results, Greenberg's classification carries the work of Kroeber 
and Sapir to its logical conclusion, confirming Sapir's suspicion that "great 
simplifications are in store for us." 

Table 6.2, overleaf, presents a chronological listing of the earliest identifi- 
cation of New World families. The list is neither complete (Algonquian and 
Siouan, among others, are lacking) nor in all probability totally accurate, 
and should be considered a first approximation. For a few groups two dates 
and names are given, indicating that the discovery was foreshadowed by an 
earlier scholar. 


6.3.2 Present Status of Amerind Classification 


In attempting to assess the present status of Amerindian classification, I 
find myself in something of a quandary, since I am confronted with two 
diametrically opposed views. Current received opinion posits dozens of 
genetically independent families in the New World; Greenberg (1987) pro- 





Table 6.2. Initial Identification of New World Families 





Year 


1635 
1645 
1768 
1782 
1797 
1800 
1816 
1818 
1840% 
1844 
1852 
1854 
1856 
1856 
1856 
1859 
1864 
1867 
1872 
1874 
1879 
1883 
1884 
1889 
1890 
1891 
1894 


1902 
1905 
1910 
1910 
1911 
1912 
1913 
1913 


1915 
1915 
1920 
1924 
1924 
1926 
1929 
1942 
1951 
1956 


1978 


1984 


Recognized by 


P.LeJeune 

А. Pérez Ribas 

D. Zeisberger 

S. F. Gilij 

B. Smith 

L. Hervás y Panduro 
J. C. Adelung, J. S. Vater 
R. Rask 

А. Gallatin 

J. Gregg 

W. Turner 

R. Latham 

R. Latham 

W. Turner 

W. Turner 

J. Buschmann 

M. Orozco y Berra 
Martius 

S. Powers 

F. Pimentel 

A. Gatschet 

H. de Charencey 
O. Stoll 

F. Boas 

M. Uhle 

D. Brinton 

F. Boas 


9. A. Lafone y Queredo 
F. Belmar 

J. Harrington 

W. Lehmann 

P. Rivet 

W. H. Mechling 

E. Sapir 

R. Dixon, А. Kroeber 


E. Sapir 

J. Swanton 

W. Lehmann 

P. Radin 

J. Swanton 

P. González Casanova 
E. Sapir 

N. McQuown 

M. Haas 

J. Greenberg 


J. Greenberg 


J. Greenberg 


Group (current name) 


Iroquoian (Northern branch only) 

Uto-Aztecan (Corachol, Nahuatl) 

Iroquoian (including Cherokee) 

Arawakan, Carib 

Iroquoian (including Cherokee and Tuscarora) 

Mayan (Yucatec, Quichean) 

Mayan (Yucatec, Quichean, Huastec) 

Eskimo-Aleut 

Numic 

Caddoan 

Athabaskan (Northern, Apachean) 

Ritwan (= Wiyot and Yurok) 

Miwok-Costanoan 

Athabaskan (Northern, Pacific Coast, Apachean) 

Uto-Aztecan (Numic, Takic), Yuman 

Uto-Aztecan (all branches except Aztecan) 

Oto-Manguean 

Ge, Macro-Ge 

Yuki-Wappo 

Mixe-Zoque 

Uto-Aztecan (including Aztecan), Tanoan 

Mixe-Zoque 

Mayan 

Wakashan 

Chibchan 

Hokan (Yuman, Seri, Tequistlatecan), Otomian 

Na-Dene (Athabaskan, Tlingit, Haida), Mosan (Salish, 
Chimakuan, Wakashan) 

Macro-Panoan (Tacana, Chimane) 

Mangean 

Tanoan (including Kiowa), Aztec-Tanoan 

Misumalpan 

Macro-Carib 

Chinantecan, Zapotecan 

Algic (- Algonquian, Wiyot, and Yurok) 

(California) Hokan, (California) Penutian, Ritwan 
(Wiyot, Yurok) 

Na-Dene (Athabaskan, Tlingit, Haida) 

Coahuiltecan 

Tlapanecan 

Mexican Penutian (Huave, Mixe-Zoque, Mayan) 

Natchez-Muskogean 

Popolocan 

Penutian (including Mixe-Zoque and Huave), Almosan 

(Mexican) Penutian (Mayan, Mixe-Zoque, Totonacan) 

Gulf 

Amerind, Chibchan-Paezan, Macro-Ge, Macro- 
Panoan, Macro-Carib, Andean, Eguatorial, 
Macro-Tucanoan 

Northern Amerind (Almosan-Keresiouan, Penutian, 
Hokan), Central Amerind (Uto-Aztecan, Tanoan, 
Oto-Manguean) 

Almosan-Keresiouan 
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vides compelling evidence that there are but three: Eskimo-Aleut, Na- 
Dene, and Amerind. More conservative linguists will no doubt feel that the 
traditional classification, listing dozens of families, should be retained until 
Greenberg's proposals have had a chance to be evaluated by other scholars. 
But since the inadequacies of more conservative classifications are so griev- 
ous, and since Greenberg's evidence is so overwhelming, I have had little 
difficulty in opting to follow Greenberg's minority opinion. I urge all who 
are interested in the classification of American Indian languages to review 
the evidence presented in Greenberg (1987) for themselves. In the re- 
mainder of this section I will take for granted the existence of the Amerind 
family. 

Before turning to the internal subgrouping of Amerind and its various 
components, it will be useful to review those groups whose genetic unity is 
beyond question. I believe the following families fall into this category: 


(15 ALGIC (= ALGONQUIAN-RITWAN) TANOAN 


ALGONQUIAN UTO-AZTECAN 
CHIMAKUAN OTO-MANGUEAN 
WAKASHAN CHIBCHAN-PAEZAN 
SALISH MACRO-GE 
CADDOAN MACRO-PANOAN 
SIOUAN MACRO-CARIB 
IROQUOIAN ANDEAN 

HOKAN EQUATORIAL 
PENUTIAN MACRO-TUCANOAN 


Within these families there are of course subfamilies whose genetic validity 
is equally secure (e.g. Yuman and Pomo within Hokan, Mayan and Miwok- 
Costanoan within Penutian); these will be mentioned below in the discus- 
sion of the subgrouping of the families listed in (15). 

Once the true dimensions of the Amerind phylum are recognized, the 
entire classification problem becomes one of subgrouping, as Greenberg 
(1960: 791) pointed out. All the languages are related, but which ones are 
more closely related to which others? What well-defined subgroups are 
there, and how do they relate to each other? Given the inherent difficulties 
in discovering the subgrouping of a family (particularly a large and ancient 
one), as compared with the relatively easier task of identifying the family 
itself, it should be clear that whatever subgrouping is proposed for Amer- 
ind must be considered merely a first approximation, subject to modifica- 
tion and revision as research proceeds. Greenberg (1987) suggests the in- 
ternal classification of Amerind shown in List 6.11. 

This classification calls for a number of observations. (1) Contrary to pre- 
vailing opinion (and Greenberg's expectations), Sapir's Algonquian-Mosan 
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List 6.11. The Amerind Phylum (Greenberg 1987) 


AMERIND: 
І NORTHERN AMERIND: 
A ALMOSAN-KERESIOUAN: 





















П CENTRAL AMERIND: 
A TANOAN 


1 ALMOSAN: B UTO-AZTECAN 
a Kutenai С OTO-MANGUEAN 
b MOSAN: HI CHIBCHAN-PAEZAN 
i CHIMAKUAN (including Timucua, Tarascan, 
ii WAKASHAN and Cuitlatec) 
iii SALISH IV ANDEAN (including 
c ALGIC: Mapudungu) 
i RITWAN V EQUATORIAL-TUCANOAN: 
ii ALGONQUIAN A MACRO-TUCANOAN 
2 KERESIOUAN: (including Nambikuara) 
a KERESAN B EQUATORIAL (including 
b SIOUAN-YUCHI Jivaroan and Chipaya) 
c CADDOAN VI GE-PANO-CARIB: 


d IROQUOIAN A MACRO-CARIB 


B PENUTIAN (including Zuni, B GE-PANO: 
Gulf, Yuki-Wappo, Huave, 1 MACRO-PANOAN 
Totonacan, Mixe-Zoque, and 2 MACRO-GE 
Mayan) 

C HOKAN (including 
Yurimangui) 


grouping is supported. (2) Keres, previously regarded as an isolate, is a 
member of Keresiouan within Almosan-Keresiouan, perhaps closest to the 
Iroquoian family. (3) Yuki-Wappo is a displaced member of the Gulf group. 
(4) Gulf is a member of Penutian. (5) Oto-Manguean, Tanoan, and Uto- 
Aztecan form a stock Greenberg calls Central Amerind. (6) Tarascan, Tim- 
ucua, and Cuitlatec, long considered isolates, are all assigned to Chibchan- 
Paezan. (7) Nambicuara, which Greenberg (1960) included in Ge-Pano- 
Carib, has been reassigned to Macro-Tucanoan. (8) The Ge-Pano-Carib stock 
posited in Greenberg (1960) has been retained, but the Andean-Equatorial 
stock posited there has been divided into two coordinate branches, Andean 
and Equatorial-Tucanoan. (9) In North America Penutian, Hokan, and 
Almosan-Keresiouan have been joined in a higher-level grouping called 
Northern Amerind. 

The internal classification of the numerous Amerind families poses a 
whole host of problems, only the most salient of which can be touched on 
here. The internal subgroupings adopted in this volume reflect the sources 
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listed іп the bibliography to this section, relevant portions of Voegelin апа 
Voegelin (1977), and the erpert advice of some 20 Amerindian scholars, 
whose assistance I have acknowledged at the beginning of this volume. 
None of these scholars, of course, is responsible for the use I have made of 
their advice. Let us examine the various Amerind families in the order in 
which they are listed in (15) above. 

Algic. The name Algic was suggested by Teeter (1965: 225) as а replace- 
ment for Algonquian-Ritwan. Of the three components of Algic, Algon- 
quian is a close-knit family covering much of the eastern United States and 
Canada, and Wiyot and Yurok are (or were—Wiyot is recently extinct and 
Yurok is moribund) single languages spoken side by side on the coast of 
northern California, some 600 miles (1,000 km) from the nearest Algon- 
quian language. Sapir's (1913) discovery of the relationship between Algon- 
quian, Wiyot, and Yurok, and the controversy it provoked, have been ex- 
amined above in some detail. Sapir's aim in this debate was to establish the 
validity of Algic (- Algonquian-Ritwan); he was noncommittal about the 
validity of Ritwan. Teeter (1965: 224) expresses doubt regarding Ritwan, 
whereas Haas (1966a: 103) considers it an open question. Goddard (1981) 
believes Ritwan is "probably a subgroup, though [this] has not yet been 
demonstrated." Recently Berman (1982) has suggested two possible inno- 
vations supporting a Ritwan group. To my knowledge, the possibility that 
either Wiyot or Yurok is closer to Algonquian than the other has never been 
raised. Given the dissimilarity between Wiyot and Yurok, however, this 
possibility cannot be ruled out a priori. 

Algonquian. Despite its enormous geographic spread, the Algonquian 
family is a close-knit group whose internal homogeneity is now generally 
accepted. Eastern Algonquian, comprising those languages spoken along 
the Atlantic seaboard, is the only clearly distinguished subgroup. Goddard 
(1979: 105) suggests that Blackfoot "is the most divergent language of the 
family." The Algonquian subgrouping followed in this volume (see List 6.12) 
is based on Goddard (1978a,b, 1979, 1981). 


List 6.12. The Algonquian Family (Goddard 1978a,b, 1979) 


ALGONQUIAN: [II ALGONQUIAN PROPER] 
I Blackfoot F Menominee 


II ALGONQUIAN PROPER: G Fox 
A Cheyenne H tIllinois 
B Arapaho I Shawnee 
C Cree J EASTERN: Micmac, 
D Ojibwa Passamaquoddy, 





E Potawatomi Eastern Abenaki, etc. 
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List 6.13. The Salish Family 
(Thompson 1979) 


SALISH: 
I Bella Coola 
JH SALISH PROPER: 
A COAST: 
1 CENTRAL 
2 TTillamook 
B TSAMOSAN: 
1 INLAND 
2 MARITIME 
C INTERIOR: 
1 NORTHERN 
2 SOUTHERN 


Chimakuan. Since the Chimakuan family consists of only two languages 
(one of which is extinct), the question of subgrouping does not arise. 

Wakashan. The small Wakashan family is sharply differentiated into 
Northern (= Kwakiutlan) and Southern (= Nootkan) branches. 

Salish. According to current opinion, a single language, Bella Coola, is 
coordinate with the remainder of the family, which itself is divided into 
three subgroups. Тһе Salish classification adopted here (List 6.13) follows 
Thompson (1979). 

Caddoan. The Caddoan family is structured like an onion, with Pawnee 
and its divergent dialect Arikara at the center, and with Kitsai, Wichita, and 
Caddo progressively farther away from the nucleus (List 6.14). 

Siouan. There is a deep cleavage between Catawba and the rest of the 
Siouan family, so deep in fact that some scholars do not regard Catawba as a 
Siouan language. Such reservations are, however, terminological, not sub- 
stantive, since it is generally accepted today that Catawba is more closely 


List 6.14. The Caddoan Family 
(Chafe 1979) 


CADDOAN: 
I SOUTHERN: Caddo 
H NORTHERN: 
А Wichita 
B PAWNEE-KITSAI: 
1 TKitsai 
2 PAWNEE-ARIKARA 
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List 6.15. Тһе Siouan Family (Rood 1979) 


SIOUAN: 
I Catawba 
II SIOUAN PROPER: 
A Mandan 
B NORTHERN PLAINS: Crow, Hidatsa 
C MISSISSIPPI VALLEY: 
1 Dakota 
2 DHEGIHA 
3 CHIWERE-WINNEBAGO 
D 1SOUTHEAST: 
1 TTutelo 
2 tBILOXI-OFO 


related to Siouan than to any other language or group. Siouan Proper is 
itself divided into three subgroups and the difficult-to-classify single lan- 
guage Mandan. Tutelo was apparently so divergent from Biloxi-Ofo that the 
validity of the Southeast group is in question. List 6.15 proposes a tentative 
subgrouping, subject to revision. 

Iroquoian. Cherokee is distantly but clearly related to the rest of the Iro- 
quoian family. The remaining languages, called Northern Iroquoian, are all 
closely related, with the languages of the Five Nations (plus the extinct Sus- 
quehannock) constituting one subgroup and Tuscarora-Nottoway another 
(List 6.16). 

Hokan. Unlike such close-knit families as Algonquian, Salish, and Іго- 
quoian, Hokan is an ancient grouping consisting of several subfamilies 


List 6.16. The Iroquoian Family (Mithun 1979) 


IROQUOIAN: 
I NORTHERN: 
А tLaurentian 
В tHuron 
C FIVE NATIONS: 
1 tSusquehannock 
2 MOHAWK-ONEIDA 
3 SENECA-ONONDAGA: 
a Onondaga 
b SENECA-CAYUGA 
D TUSCARORA-NOTTOWAY 
П SOUTHERN: Cherokee 
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(e.g. Pomo, Yuman), a number of isolated languages (e.g. Karok, Shasta, 
Washo, Seri), and a few outliers (e.g. Yurimangui, Jicague, Tlapanec, 
Coahuiltecan), whose affinity with the rest of Hokan continues to be a 
matter of dispute. In the context of recent attempts to break up the Hokan 
stock into many independent families, questions of subgrouping have ob- 
viously fallen by the wayside. Furthermore, no Amerindian group has suf- 
fered so grievously from binary comparisons as Hokan. Since a binary ap- 
proach to language classification is useless in determining subgrouping, it 
is not surprising that there has been no progress in unraveling the internal 
relationships of Hokan in the past half-century. The Hokan subgrouping 
adopted in this volume (List 6.17) is based on Sapir (1929), as modified by 
Rivet (1942), Greenberg апа Swadesh (1953), and Greenberg (1987); the in- 
ternal structure of Pomo follows Halpern (1964), and Yuman follows Lang- 
don (1981). 


List 6.17. The Hokan Stock (Sapir 1929 and others) 


HOKAN: IV SERI-YUMAN: 
I NORTHERN: A Seri 
A Karok B YUMAN 
B Shasta V TWAICURI-QUINIGUA 
C TChimariko VI COAHUILTECAN 
D PALAIHNIHAN ҮП TEOUISTLATECAN 
E TYANAN ҮШ SOUTHERN: 
F POMO A TLAPANECAN 
П Washo B Jicaque 





HI SALINAN-CHUMASH C TYurimangui 


Penutian. Penutian is, like Hokan, an ancient and controversial grouping 
that was first recognized in California and later found to extend southward 
into Mexico and, in the case of Penutian, northward into Canada. Yet the 
internal texture of Penutian, characterized by continuity, contrasts sharply 
with the discontinuous nature of Hokan. Whereas only two languages were 
added to Hokan after 1925 (Yurimangui, Jicaque), Penutian has shown a 
particular predilection for expansion, often in different directions in the 
work of different scholars. The Penutian classification adopted here (List 
6.18) is based on Sapir (1929), as modified by McQuown (1942) and Green- 
berg (1987). The Muskogean classification follows Haas (1979), and the 
Mayan subgrouping is based on Kaufman (1982). 

Greenberg's Penutian stock differs from earlier proposals in two impor- 
tant respects: (1) the Gulf group, first established in Haas (1951) and previ- 
ously linked to Algonquian (Haas 1958a) and to Hokan (Swadesh 1954), is 
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for the first time included in Penutian; and (2) the Yuki-Wappo family, con- 
sidered by Shipley (1976: 447) “the most intriguing and problematical in 
California,” is included in the Gulf group. Despite the fact that Yuki and 
the distantly related Wappo (pace Sawyer 1980) are (or were) spoken in 
northern California, over 2,000 miles (3,200 km) from the nearest Gulf lan- 
guage, the affinity of Yuki-Wappo with Gulf is, іп Greenberg's opinion, so 
obvious that had they been spoken in Louisiana they would have been in- 
cluded in the Gulf group without fanfare. 


List 6.18. Тһе Penutian Stock (Sapir 1929, Greenberg 1987, and others) 


PENUTIAN: VII GULF: 
I CANADIAN: Tsimshian A TUNICA-CHITIMACHA 
II WASHINGTON: Chinook B YUKI-WAPPO 


HI ОКЕСОМ C NATCHEZ-MUSKOGEAN: 
IV PLATEAU 1 TNatchez 
V CALIFORNIA: 2 MUSKOGEAN 

A WINTUN ҮШ MEKICAN: 

B MAIDUAN A Huave 

C YOKUTS B TOTONACAN 


D MIWOK-COSTANOAN 
VI NEW МЕХІСО: Zuni 


C MIKE-ZOOUE 
D MAYAN 





The inclusion of Yuki-Wappo in Penutian, and even in Gulf, had been 
foreshadowed in the literature, but since Yuki has been compared over the 
years with various groups and languages, the earlier proposals attracted 
little attention. In 1906 Kroeber compared Yuki and Yokuts (a Penutian lan- 
guage) and noted a structural similarity, though he did not claim they were 
related. Radin (1919) first connected Yuki with Penutian, and Shipley 
(1957) suggested 32 Yuki-Penutian cognates. Sapir (1929), on the other 
hand, included Yuki in his rather amorphous Hokan-Siouan phylum, and 
Swadesh (1954) placed Yuki in his “Hokogian” (= Hokan + Gulf) family. 
Gursky (1965а) also assigned Yuki to Hokan, and Elmendorf (1963, 1964) 
proposed links with Siouan and Yuchi. In spite of these diverse proposals 
(or perhaps because of them), both consensus classifications left Yuki- 
Wappo as an isolated family without known relatives. Greenberg's discov- 
ery that Yuki and Wappo are displaced Gulf languages confirms a suspicion 
expressed in Swadesh (1954). Swadesh (1954: 324), who grouped Yuki with 
Gulf and (Southern) Hokan (i.e. Coahuiltecan), quotes a letter from Robert 
Melton supporting his conclusions: 


Ethnological evidence [indicates] that the Yuki do not “belong” in California, and 
have had seemingly direct contact with the Pueblo and SW, most likely in passage 
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from near New Orleans. The chiefship pattern among the Yuki is very un-Califor- 
nian and very possibly closer to the SE than elsewhere. They were the most warlike 
in California, except for the extreme South. They sound re . . . Шке. . . Chitimacha 
[a Gulf language of Southern Louisiana]. 


Tanoan. Tanoan is a small, close-knit family. All the Tanoan languages ex- 
cept Kiowa are (or were) spoken in Pueblo societies in what is today New 
Mexico. Since Kiowa was originally spoken throughout the Great Plains, 
from Montana to Mexico, it was at first assumed that Kiowa would prove to 
be the most divergent member of the family. Such expectations, however, 
proved false, and it is now generally agreed that the geographic and cul- 
tural distance between Kiowa and the remainder of the family is not matched 
by a similar linguistic divergence, Davis (1979) suggests that Tiwa (some- 
times considered a single language, though there is a fairly sharp division 
between Northern and Southern subgroups) is closest to the extinct Piro. 
Furthermore, Tiwa(-Piro) is genetically closest to Tewa, and Kiowa and 
Towa may also constitute a subgroup of Tanoan, though the evidence for 
this is less clear than in the case of Tiwa-Tewa (see List 6.19). 

Uto-Aztecan. A general consensus has developed that Uto-Aztecan is 
composed of eight genetic units: six families (Numic, Takic, Pimic, Taraca- 
hitic, Corachol, Aztecan) and two isolated languages (Tübatulabal, Hopi). 
(The Giamina language, now extinct, may have been a ninth branch.) Diver- 
gence of opinion arises solely with regard to their subgrouping. List 6.20 
compares three proposals. The validity of Shoshonean and and the validity 
of Sonoran (with or without Aztecan) are of course independent questions; 
Lamb (1964: 122) reports that Kroeber accepted Shoshonean but rejected 
Sonoran. The Uto-Aztecan classification adopted in this volume (List 6.21) 
is based on Lamb (1964). 

According to Steele (1979), some scholars believe that Tübatulabal is most 
closely related to Numic, Hopi is closest to Takic, and Southern Numic and 
Central Numic constitute a group coordinate with Western Numic. 


List 6.19. The Tanoan Family 
(Davis 1979) 


TANOAN: 
I KIOWA-TOWA 
П TEWA-TIWA: 
A Tewa 
B TIWA-PIRO: 
1 TPiro 
2 TIWA: 
a NORTHERN 
b SOUTHERN 
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List 6.20. Proposed Subgroupings of the Uto-Aztecan Family 


Whorf 1935, Voegelin, Voegelin Heath 1975 (reported 
Lamb 1964 & Hale 1962 in Steele 1979) 
UTO-AZTECAN: UTO-AZTECAN: UTO-AZTECAN: 
I NUMIC I SHOSHONEAN: I NORTHERN: 
II Tübatulabal A NUMIC A NUMIC 
III TAKIC B Tübatulabal B Tübatulabal 
IV Hopi C TAKIC C TAKIC 
V PIMIC D Hopi D Hopi 
VI TARACAHITIC II SONORAN: II SOUTHERN: 
VII CORACHOL A PIMIC A PIMIC 
ҮШ AZTECAN B TARACAHITIC B TARACAHITIC 
C CORACHOL C CORACHOL 
III AZTECAN D AZTECAN 


List 6.21. The Uto-Aztecan Family (Lamb 1964 and others) 


UTO-AZTECAN: IV Hopi 
I NUMIC: V PIMIC: 
A WESTERN А PIMAN 
B CENTRAL B TEPEHUAN 
C SOUTHERN V] TARACAHITIC: 
II Tübatulabal A tTubar 
Ш TAKIC: В TARAHUMARA-GUARIJIO 
А TGabrielino C CAHITA 
B SERRAN VII CORACHOL 
C CUPAN: ҮШ AZTECAN: 
1 Luisefio А tPochutla 


2 CAHUILLA-CUPENO B AZTEC PROPER 





Oto-Manguean. A half-dozen small families (Otomian, Mixtecan, Popo- 
locan, Chinantecan, Zapotecan, Manguean) and the isolated language 
Amuzgo make up the Oto-Manguean family. The classification adopted 
here (List 6.22) is based on Rensch (1976), who has elaborated the basic 
structure postulated by Robert Longacre. Some linguists have proposed the 
inclusion of Huave and Tlapanecan in Oto-Manguean. Greenberg (1987), 
however, classifies them as (Mexican) Penutian and Hokan, respectively, 
and I have followed that position. Attempts at an internal subgrouping of 
the seven Oto-Manguean branches have not yet identified any higher-level 
groupings. In particular, the (extinct) Manguean branch, which was geo- 
graphically separated from the remainder of the family by almost 700 miles 
(1,100 km), does not show a comparable linguistic divergence. 
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List 6.22. The Oto-Manguean Family (Rensch 1976) 
OTO-MANGUEAN: IV POPOLOCAN 







І OTOMIAN V CHINANTECAN 
П МІХТЕСАМ VI ?АРОТЕСАМ 
Ш Amuzgo ҮШ TMANGUEAN 


Chibchan-Paezan. All seven South American stocks (i.e. Chibchan-Paezan, 
Andean, Macro-Tucanoan, Eguatorial, Macro-Carib,Macro-Panoan, Macro- 
Ge) derive from the work of Greenberg (1960, 1986). The subgrouping 
within these seven stocks follows in the main Greenberg (1986), though 
Voegelin and Voegelin (1977) and Grimes (1984) have also been used. 

Chibchan-Paezan, extending from Central America deep into South 
America, consists of two families, Chibchan and Paezan (see List 6.23). Sev- 
eral languages that had long been regarded as language isolates (Tarascan, 
Cuitlatec, Timucua) are included іп Chibchan-Paezan іп Greenberg (1987). 
Tarascan and Cuitlatec (both spoken in southern Mexico) have been added 
to the Chibchan branch, and Timucua is included іп Paezan. The addition 
of Timucua (formerly spoken in northern Florida, but long extinct) is sur- 
prising, in that Chibchan-Paezan languages were thought to be restricted 
to Central and South America. Nevertheless, Greenberg's conclusions are 
similar to those of Granberry (1970, as reported in Crawford 1979), who 


List 6.23. The Chibchan-Paezan Stock (Greenberg 1987) 





















CHIBCHAN-PAEZAN: П PAEZAN: 


І CHIBCHAN: А tTimucua 

А Tarascan В Warao 

В TCuitlatec С TKunza 

С Хіпса D tBetoi 

D Lenca E tChimu 

E Paya F Itonama 

F YANOMAM G THUARPE 

G NUCLEAR CHIBCHAN: H #JIRAJARAN 
1 Motilon I MURA 
2 Cuna J NUCLEAR PAEZAN: 
3 MISUMALPAN 1 tAndaqui 
4 RAMA 2 CHOCO 
5 TALAMANCA 3 INTER-ANDINE 
6 GUAYMI 4 BARBACOAN 
7 tANTIOQUIA 
8 ARUAK 


9 CHIBCHAN PROPER 


6.3 Amerind 239 


List 6.24. The Andean Stock (Greenberg 1987) 


ANDEAN: IV QUECHUAN 
I TNORTHERN: V AYMARAN 
А tCulli VI SOUTHERN: 
В tSec A tPuelche 
C tLeco В MAPUDUNGU 
D tCATACAO (= ARAUCANIAN) 
E tCHOLONA C QAWASQAR-YAMANA 


П URARINA-WAORANI 
HI CAHUAPANAN-ZAPAROAN 


D PATAGONIAN 





concluded that Timucua was genetically closest to Warao (a Paezan lan- 
guage of Venezuela). 

Andean. The Andean family extends down the western coast of South 
America from present-day Ecuador to Tierra del Fuego. Although Andean 
contains fewer languages (about 20) than any other South American branch 
of Amerind, its speakers account for approximately half of the entire Amer- 
ind population. The extremely large number of Quechua speakers and sig- 
nificant numbers of Aymara and Mapudungu (= Araucanian) speakers 
(see Table 6.1) accounts for about nine million people. Greenberg (1987) 
subdivides the Andean family into a half-dozen branches, as shown in List 
6.24. On the subgrouping within Southern Andean, I have followed the ad- 
vice of Christos Clairis (personal communication). 

Macro-Tucanoan. About 50 languages make up the Macro-Tucanoan stock. 
Most of the languages are found in what is today western Brazil, but there 
are outliers in both southern and eastern Brazil. According to Greenberg 
(1987), the family is composed of seven subfamilies and a dozen isolated 
languages (see List 6.25). The Tucanoan subgrouping adopted in this vol- 
ume follows Waltz and Wheeler (1972). 


List 6.25. The Macro-Tucanoan Stock (Greenberg 1987) 





MACRO-TUCANOAN: X tPankararu 
I tAuishiri XI Shukuru 
П Canichana XII Uman 
III Capishana XIII CATUQUINAN 
IV TGamella XIV HUARI 
V Iranshe XV ARUTANI-SAPE 
VI Koaia XVI NAMBIKUARAN 
VII Movima XVII PUINAVE-MAKU 
ҮШ Muniche XVIII TICUNA-YURI 
IX Natu XIX TUCANOAN 
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List 6.26. The Eguatorial Stock (Greenberg 1987) 


EOUATORIAL: XII MACRO-ARAWAKAN: 
I Cayuvava A Katembri 
II Camsa B tOtomaco 
HI tTaruma С TINIGUAN 
IV Trumai D GUAHIBAN 
V TTusha E ARAWAKAN: 
VI Yuracare 1 TGuamo 
ҮШ PIAROA-SALIBA 2 TTaino 
ҮШ TTIMOTE 3 Chamicuro 
IX JIVAROAN 4 ARAWAN 
X ZAMUCOAN 5 CHIPAYAN 
ХІ KARIRI-TUPI 6 CHAPACURAN 
7 MAIPURAN 





Equatorial. The Equatorial family contains more languages (about 150) 
and is more widespread than any of the other six branches of Amerind 
found in South America. Its geographic distribution is enormous, extend- 
ing from the islands of the Caribbean as far south as Uruguay; on an east- 
west axis it literally spans the entire continent. Two subfamilies of Equa- 
torial, Arawak (about 80 languages) and Tupi (about 50), account for most 
of the languages and much of its broad distribution. Greenberg (1987) sub- 
divides the family into a half-dozen isolated languages and a half-dozen 
subfamilies, as illustrated in List 6.26. 

Macro-Carib. Greenberg (1987) joins two single languages and two small 
families with the larger and more widespread Carib family to form the 
Macro-Carib stock (see List 6.27). For the most part these 50 languages are 
spoken in the northern regions of South America, from Colombia to French 
Guiana, though there is one group of Carib languages located far to the 
south in central Brazil. The Carib classification followed in this volume 
comes from Durbin (1977). 


List 6.27. The Macro-Carib Stock 
(Greenberg 1987) 


MACRO-CARIB: 
I Andoke 
II +Сисига 
Ш PEBA-YAGUAN 
IV BORA-WITOTO 
V CARJB 
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List 6.28. Тһе Macro-Panoan Stock 
(Greenberg 1987) 


MACRO-PANOAN: 
I TCHARRUAN 
II LULE-VILELA 
III MASCOIAN 
IV Chimane (-- Moseten) 
V MATACO-GUAICURU: 
A GUAICURUAN 
B MATACO 
VI PANO-TACANA: 
A PANOAN 
B TACANAN 


Macro-Panoan. About 50 languages are all that survive today of a family 
that once extended from Peru to Uruguay. Macro-Panoan unites a few small 
families (some extinct), the language isolete Chimane (= Moseten), and the 
more numerous and widespread Panoan and Tacanan subfamilies. Panoan 
and Tacanan form one higher-level grouping, and Mataco and Guaicuruan 
another, as seen in List 6.28. 

Macro-Ge. Languages belonging to the Macro-Ge stock were originally 
spoken over a broad expanse of eastern South America. Today there are 
only about two dozen surviving languages and the internal structure of the 
family is far from clear. About all that can be said with assurance is that the 
Ge and Kaingang families are most closely related to each other, as shown 
in List 6.29. 

The following bibliography, like all the others in this volume, is arranged 
topically. Since this bibliography is the longest in the book, and is sub- 
divided into almost two dozen subsections, the reader who wishes to look 
up a particular citation in the text must be careful to look under the appro- 


List 6.29. The Macro-Ge Stock (Greenberg 1987) 





MACRO-GE: 
I Rikbaktsa IX tCHIQUITO 
II Fulnio X #КАМАКАМ 
III Guato XI TPURI 
IV Opaye XII YABUTI 
V tOti XIII MASHAKALI 
VI Caraja XIV GE-KAINGANG: 
VII BORORO А KAINGANG 
VIII BOTOCUDO B GE 
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priate subsection. Books and articles are classified in this bibliography as 
narrowly as possible. This means that а book dealing exclusively with Al- 
gonquian languages will be listed under Algonquian, even if it has been 
referred to in a more general context in the text itself. This topical arrange- 
ment also means that some works published by a scholar in the same year 
are, according to convention, distinguished Бу а and b following the date, 
even though the titles may be in different subsections of the bibliography. 
The first subsection (Amerind) contains works of a general nature as well as 
a sampling of books and articles that have called attention to similarities 
between supposedly independent Amerind groups. In the latter case the 
groups (or languages) compared are given in brackets following the source. 
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Prospects for Future Research 


After a period of relative quiescence, genetic classification has, since 1950, 
undergone a renaissance, with discoveries as remarkable as any made by 
the nineteenth-century pioneers. More scholars than ever before are cur- 
rently interested in taxonomy, despite the pejorative connotation that the 
word taxonomic has acquired in recent years in certain linguistic circles. 
Moreover, there is every indication that this renewed interest will continue 
for the remainder of this century. In this chapter, after first summarizing 
various methodological errors that should be avoided in future research, I 
will indicate the directions that work should take, with respect both to low- 
level refinement and large-scale groupings. 


7.1 METHODOLOGICAL ERRORS TO AVOID 


Some of the errors discussed in this section have fortunately become rare 
in recent years; others continue to flourish. I will discuss here only the most 
fundamental and most common ones. 

Among the most basic errors, especially in the early literature, was the 
belief that the principles of genetic linguistics, as developed for Indo-Euro- 
pean languages, were not readily applicable to languages in other parts of 
the world that lacked early written records or were spoken by people with a 
low level of technology and/or non-Caucasian racial traits. It is now well 
established that the principles of genetic classification are valid for the en- 
tire human population. That genetic linguistics happened to be discovered 
largely in terms of the Indo-European languages merely by happenstance is 
still not fully appreciated. 
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A. second major error has been the adoption of too narrow a purview. 
This may manifest itself in several ways. At its worst it involves the binary 
comparison of two languages (or groups), usually arbitrarily selected, in an 
attempt to prove that they are related. This approach confuses genetic rela- 
tionship with classification, the latter being a far more powerful concept en- 
compassing the former. In order to investigate genetic classification fruit- 
fully, one must make every effort to work on a valid genetic node, though 
the node itself may represent a group of languages that has only recently 
diverged from a common origin (e.g. Slavic, Bantu, Polynesian, Mayan) or 
one of considerable antiquity (e.g. Nilo-Saharan, Australian, Amerind). It 
is incumbent on the investigator to study the entire group as fully as possible 
and to ignore arbitrary geographic and political boundaries. One can no 
more study Amerind by comparing, say, Penutian and Algonquian than 
one can study Indo-European in terms of Slavic and Celtic alone. In classi- 
ficatory work it is far better to err by comparing too many languages than 
too few, so where there is legitimate doubt about the boundary of some ge- 
netic group, an investigator would be well advised to include in his study 
any possible member of the group. 

A related error involves the comparison of one family with only part of 
another family. Thus, for example, Indo-European is still sometimes com- 
pared directly with Semitic. There is no longer any doubt, however, that 
Semitic is but one branch of Afro-Asiatic, so any meaningful comparison 
must be between Indo-European and all of Afro-Asiatic. Should it turn out 
that resemblances are restricted to Indo-European and Semitic, this would 
constitute a strong argument that the similarities are due to borrowing and 
not to common origin. 

One of the more serious errors in contemporary classification involves 
the artificial standards of proof of genetic affinity that are still followed by 
many linguists. In the attempt to recognize valid cognates, criteria such as 
the presence of regular correspondences and the reconstruction of proto- 
forms are still commonly invoked. On the semantic side, some scholars 
even go so far as to require an identity of meaning among cognates. Ob- 
viously, research limited by such artificial constraints as these will be un- 
able to probe very deeply into the past. 

The early error of confusing genetic classification, which is based on 
common origin, with classifications based either on typological (e.g. tone, 
word order) or nonlinguistic criteria (e.g. race, level of technology) is en- 
countered less frequently today than in the past. Similarly, the problem of 
distinguishing borrowed words from the inherited stock is now handled 
with greater sophistication than in an earlier day. Borrowing has befuddled 
genetic classification in two ways. First, the failure to distinguish borrowed 
roots from inherited ones undermines the basis of genetic classification. 
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Equally pernicious, however, has been the belief that borrowing may oper- 
ate almost without constraints; from this perspective, shared by Franz 
Boas, Malcolm Guthrie, and others, almost any potential cognate can be 
attributed to borrowing. Recent studies in bilingualism (see Grosjean 1982) 
have confirmed what astute scholars have always known: borrowing takes 
place under very special circumstances, usually involving bilingualism, 
and affects a fairly limited area of vocabulary (chiefly nouns). The idea that 
any root can “radiate out," contaminating languages at great distances, con- 
fuses the relative ease of cultural diffusion of artifacts with the much more 
restrictive nature of linguistic diffusion. 

One still encounters today, though with decreasing frequency, argu- 
ments to the effect that family A and family B cannot be related because 
differences in some area of linguistic structure (e.g. morphology, color ter- 
minology, numerals) are simply too profound. This is what I have referred 
to as the fallacy of negative evidence; scholars who advance such argu- 
ments fail to appreciate that in detecting common origin it is similarities 
that count, not differences. 

With regard to subgrouping, both methodology and characteristic errors 
are different from those associated with proving genetic affinity. One com- 
mon error results from an unsophisticated handling of putative innova- 
tions, either by giving all innovations equal weight or by failing to recognize 
that some apparent innovations may in reality be retentions from the proto- 
language. If a root is found in just one of, say, eight branches of a family, 
many scholars assume the root to be an innovation of that branch and fail to 
realize that it may instead be a root of the proto-language that has survived 
only in that branch. Second, one frequently encounters a rivalry between 
different subgrouping criteria, usually in the form of a "shared innova- 
tions" vs. "lexicostatistics" debate. Such arguments are essentially futile, 
since all methods of discovering the underlying historical reality should be 
used, where possible. When different methods (based on different assump- 
tions) all point in the same direction, we may have greater confidence that 
the indicated subgrouping is correct. And when they differ, we should rec- 
ognize that these methods are imperfect, and that some work better than 
others in certain contexts. This failure to distinguish clearly methods and 
goals has led at times to proposed subgroupings that are inherently im- 
plausible. (A similar adherence to supposed methodological principles has 
also led to the postulation of totally improbable sound systems for recon- 
structed languages.) 

Another common subgrouping error derives from the widespread belief 
that an error of commission is far more serious than an error of omission. 
In reality, both kinds of errors are equally serious, and the scholar who fails 
to work out the subgrouping in as much detail as possible cannot be said to 
have completely fulfilled the task of classification. 
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А final subgrouping error results from failing to recognize the hierarchical 
nature of a subgrouping. For a long time it was believed that the Bantu fam- 
ily was an independent family related to West Sudanic (= Niger-Congo). In 
fact, the autonomous status of the Bantu family is a historical artifact that 
does not correspond to linguistic reality. Bantu is rather merely one sub- 
family of the vast Niger-Congo stock. Ап analogous development took 
place with respect to Melanesian (= Oceanic) and Polynesian languages. 
The lesson to be learned from these examples is that subgrouping is more 
than simply stringing together related groups. 


7.2 LOW-LEVEL REFINEMENT 


Although the discovery of distant genetic relationships is intrinsically 
more exciting than working out the internal structure of low-level genetic 
nodes, the latter type of work has the advantage of being more readily ap- 
proachable without an enormous period of preparation. Furthermore, 
there are innumerable areas where someone with the requisite background 
could make a substantial contribution to linguistic taxonomy in a reason- 
able period of time, depending of course on the size and difficulty of the 
problem tackled. Therefore, someone who wishes to engage in taxonomic 
research would be well advised to begin with a clearly delineated group 
whose internal subgrouping has never been worked out. Such modest 
projects are likely to be far more beneficial for linguistic taxonomy (and 
more instructive for the investigator) than the pursuit of some fanciful 
hunch that, say, Haida is really an Altaic language. 

In approaching a problem of subgrouping, the principal criterion for de- 
termining valid subgroups is the establishment of shared innovations for 
each subgroup postulated. Not all innovations are of equal importance, 
since many phonological and semantic shifts may reflect the independent 
operation of universal tendencies rather than a shared innovation. For ex- 
ample, the voicing of voiceless stops between vowels or before nasal conso- 
nants is such a common shift that it may easily take place independently in 
different languages. An associated problem is that closely related languages 
often "drift" in similar directions, beyond what could be expected on the 
basis of universal tendencies alone. I do not wish to imply that phono- 
logical and semantic innovations are without value. On the contrary, they 
can at times, and particularly in sufficient number, be probative, but they 
must be handled with care. 

Because they combine the arbitrariness of sound and meaning, lexical in- 
novations are particularly important. Two languages are hardly likely to in- 
vent the same word (say, the sounds plok with the meaning ‘sky’) indepen- 
dently of one another. Therefore, lexical innovations are fairly impervious 
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to phenomena such as convergence and drift, and relatively greater weight 
should be given them for this reason. 

One final aspect of subgrouping merits attention. Consider the following 
comments from Welmers (1973: 15—16): 


Language division usually appears to take the form of bifurcation; it must normally 
be expected that the model for the origin of four branches would be one of the 
following: 


A three-way division from a single node is historically possible, of course; but 
where there is adequate evidence for reconstructing language history, few such 
cases have been demonstrated. References to “coordinate branches” should not be 
taken as actual hypotheses of multiple divisions from single nodes; they are more 
likely to be, by implication, mere admissions that the order of bifurcations is not 
clear from the evidence to date. 


I think Welmers is generally correct on this point. Many linguists confuse 
the indeterminacy of the data with historical reality. There is, however, an- 
other factor that leads away from strict bifurcations, namely, the hetero- 
geneity of any living language, particularly one that has many speakers or 
is spread over a large area. Suppose that languages A, B, and C have devel- 
oped from dialects a, b, and c of an original language L. Then a simple tri- 
chotomy more accurately describes the historical developments than a 
bifurcation. In most instances, however, it is unclear whether multiple 
branchings are due to the indeterminacy of the data, to dialect divisions 
already present in the original language, or to a combination of the two. 
The many examples of multiple branching in the classification presented in 
section 8.6 (e.g. Nilo-Saharan, 9; Indo-European, 9; Central and Western 
Trans- New Guinea, 12; Australian, 15; Oceanic, 17) are to be interpreted in 
this light. In some cases (e.g. Oceanic) we may be fairly certain that the 
seeming multiplicity of primary branches is due to a fairly rapid expansion 
over a large area rather than to dialect differences in Proto-Oceanic. In other 
cases (perhaps Indo-European) the different branches may well derive from 
different dialects of the proto-language, at least in part. 

In closing this section, I would like to return to a topic that has appeared 
several times earlier in this volume, namely, the dispute between those who 
would represent linguistic relationships by a tree structure and those who 
prefer a wave model. Branching-tree diagrams are appropriate for repre- 
senting sharp cleavages between languages (or groups of languages) where 
bundles of isoglosses clearly distinguish one entity from another. The wave 
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model, on the other hand, seems better suited to describing cases where 
sharp divisions are absent and overlapping isoglosses make the establish- 
ment of a family tree impossible. Whereas the family tree model empha- 
sizes the discrete nature of linguistic groups, the wave model underscores 
their clinal properties. These clinal properties are most apparent at the lower 
levels of classification, such as among the dialects of a highly diversified 
language (e.g. Ijo, German, Kurdish, Bikol, Zapotec) or among closely re- 
lated languages spoken over a continuous geographic area (e.g. Bantu, In- 
dic, Chinese, Athabaskan). Tree structure, however, is most obvious at 
higher levels of classification (e.g. between the various subfamilies that 
make up Afro-Asiatic, Indo-European, or Amerind). 

Although the hierarchical (tree) and clinal (wave) properties of genetic 
classifications are sometimes portrayed as competing (or even conflicting) 
tendencies, they are in fact complementary properties that are simultane- 
ously present at all levels. As early as 1872 Johannes Schmidt recognized 
that the seemingly distinct branches of Indo-European (e.g. Celtic, Italic, 
Germanic, Slavic) exhibit overlapping isoglosses, though they are few іп 
number and hence overshadowed by the innumerable differences between 
these families. Furthermore, dialects of a single language frequently are 
separated by bundles of isoglosses Just as languages are. Viewed in this 
light, the long-simmering feud between the “wave school” and the “tree 
school” can be seen to be misguided, for we are not dealing with an either/ 
or situation. Both hierarchical and clinal properties are consequences of lin- 
guistic evolution that may be present at any level of a genetic classification. 
The classification offered in the following chapter should be interpreted in 
this way. At the highest levels the branching indicates primarily sharp divi- 
sions between distinct groups, though the possibility of overlapping iso- 
glosses is never excluded. At the lowest levels the branching represents 
relative distance between languages (or groups of languages), but there is 
often no sharp boundary separating them into distinct groups. 


7.3 LARGE-SCALE GROUPINGS 


In Chapters 2-6, I reviewed the history of genetic classification and sug- 
gested that the taxonomic literature indicates the world's currently extant 
languages, except for a small number of language isolates (e.g. Basque, 
Burushaski, Nehali, Ket), belong to one of the 17 families shown in List 7.1. 

As I have attempted to make explicit in the discussion of each of these 
families, they vary in what might be called genetic transparency, from the 
small, close-knit Eskimo-Aleut and Chukchi-Kamchatkan families to the 
large and very ancient Australian and Indo-Pacific. The other groups fall 
somewhere between these two extremes. 
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List 7.1. The World's Language Families (Ruhlen 1987) 


KHOISAN URALIC-YUKAGHIR SINO-TIBETAN 
NIGER-KORDOFANIAN АТТАС AUSTRIC 
NILO-SAHARAN CHUKCHI- INDO-PACIFIC 
AFRO-ASIATIC KAMCHATKAN AUSTRALIAN 
CAUCASIAN ESKIMO-ALEUT NA-DENE 





INDO-HITTITE ELAMO-DRAVIDIAN AMERIND 


Although the reduction of the worlds roughly 5,000 languages to a mere 
17 families would be considered a radical step by a good number of contem- 
porary linguists, there are indications that even further consolidation is not 
only possible, but necessary. The bibliography to this section includes 
(under the rubric Genetic Links Between Families) a list of articles and 
books that discuss, either favorably or unfavorably, the possibility of genetic 
links among these 17 families; this listing is intended to be representative of 
such proposals and is in no way exhaustive. In seeking a more comprehen- 
sive classification, one must avoid the trap of binary comparison, which at 
this level of research seems so tempting. Rather, one should adopt a global 
approach, taking into account all of the world's languages, as Dyen (1959: 
546) suggested: “Greenberg's method of determining “valid relationships” 
by “mass comparison” without limitation as to number seems to imply that 
the optimum comparison is that of all the languages in the world at once. In 
fact, I think we must agree with this proposition; for, in effect, the discov- 
ery of genetic relationships is a way of partitioning the set of all languages 
in the world." 

The problem with studies attempting to "prove," for example, the Ural- 
Altaic hypothesis is that they invariably limit themselves to citing evidence 
that the two families are genetically related and never face up to the even 
more important question of whether the two families (Uralic and Altaic in 
this instance) are more closely related to each other than to any other lan- 
guage or group—that is, they constitute a valid genetic node. It is the fail- 
ure to address just this issue that makes all of the various Altaic proposals 
so inconclusive. Japanese may well be related to Altaic, but is it more closely 
related to Altaic than to Korean, Ainu, Gilyak, Chukchi-Kamchatkan, and 
other languages? This is the question that Altaicists should, but almost 
never do, tackle. They content themselves with a piece of the puzzle. 

As one can see in the bibliography to this section, certain families (e.g. 
Ural-Altaic, Indo-European-Uralic) have been alleged to be related many 
times, by different scholars, whereas other families (e.g. Khoisan and 
Amerind) have not, to my knowledge, ever been linked directly. The mere 
fact that certain proposals are more common than others does not neces- 
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sarily ensure that they are more valid. The numerous proposals linking 
Indo-European with various other families are as much a reflection of the 
Eurocentric bias of the linguistic literature as they are of the relationships 
themselves. Nevertheless, though there are disagreements on details (rather 
substantial details, it must be admitted), the literature cited in the bibli- 
ography to this section indicates fairly clearly that there are genetic rela- 
tionships between European and Asian families. Precisely which families 
are to be included in this vast phylum is still in dispute. This complex of 
European and Asian families has been referred to as Nostratic, Eurasiatic, 
and Mitian, the last term being based on the characteristic first- and 
second-person pronouns of the stock (mi ‘I,’ ti *you'). Recognition that cer- 
tain Eurasiatic families were related came as early as 1576, when Martin 
Frobischer proposed a relationship between Uralic and Eskimo. In 1869 Vil- 
helm Thomsen linked Indo-European with Uralic, and we saw in section 
4.1 that Uralic has been widely connected with Altaic since the nineteenth 
century. In the twentieth century a number of scholars have argued for Eu- 
rasiatic connections. List 7.2 illustrates several such proposals. Where pre- 
cisely the bounds of Eurasiatic lie is a matter to be resolved by future 
research. 


List 7.2. Proposed Subgroupings of the Eurasiatic (= Nostratic) Phylum 


Pedersen 1931 


Collinder 1965 


EURASIATIC: EURASIATIC: 
I SEMITIC I INDO-EUROPEAN 
II INDO-EUROPEAN II URALIC-YUKAGHIR 

Ш URALIC III ALTAIC-KOREAN 

IV Yukaghir IV СНОКСНІ-КАМСНАТКАМ 
V ALTAIC V ESKIMO-ALEUT 


VI ESKIMO-ALEUT 


Illich-Svitych 1971-84, 
Menges 1977, Birnbaum 1978 


Greenberg 1987 


EURASIATIC: 
I INDO-HITTITE 


EURASIATIC: II URALIC-YUKAGHIR 

I WESTERN: III ALTAIC 
A AFRO-ASIATIC IV KOREAN-JAPANESE: 
B SOUTH А Korean 

CAUCASIAN B Ainu 

C INDO-EUROPEAN C JAPANESE-RYUKYUAN 

II EASTERN: V CHUKCHI-ESKIMO: 
A URALIC A. Gilyak 
B DRAVIDIAN B CHUKCHI-KAMCHATKAN 
C ALTAIC C ESKIMO-ALEUT 
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List 7.3. The World's Language Families (Greenberg 1987) 


KHOISAN SOUTH CAUCASIAN AUSTRO-TAI 
NIGER-KORDOFANIAN EURASIATIC INDO-PACIFIC 
NILO-SAHARAN DRAVIDIAN AUSTRALIAN 
AFRO-ASIATIC SINO-TIBETAN NA-DENE 
NORTH CAUCASIAN AUSTROASIATIC AMERIND 





Greenberg (1987) classifies the world's languages into the 15 families 
shown in List 7.3. Proposals for joining North Caucasian and South Cauca- 
sian in a single Caucasian family were discussed in section 2.5 and those for 
uniting Austroasiatic and Austro-Tai in an Austric phylum, in section 4.5. 
Among other proposals for further consolidation, the most common are the 
following. (1) Niger-Kordofanian has been linked with Nilo-Saharan by 
Gregersen (1972), Boyd (1978), and Bender (1981), supporting a position 
advocated by L. Homburger (1941). (2) As List 7.2 indicates, both Afro- 
Asiatic (often just the Semitic branch) and South Caucasian (7 Kartvelian) 
have been linked with Indo-European. (3) The external relationships of the 
Dravidian family have long been a subject of controversy, with proposals as 
varied as Uralic, Australian, and Nilo-Saharan. The Nilo-Saharan connec- 
tion appears the most promising. (4) Sino-Tibetan and various members of 
Austric have been linked by scholars since the nineteenth century, chiefly 
on the basis of typological similarities and undiagnosed borrowings. (5) 
Wurm (1982) suggests that the Indo-Pacific and Australian phyla may some 
day be shown to be related, a possibility that Greenberg (1971) entertained 
but left in abeyance. (6) Na-Dene has been compared with Chinese by Sapir 
(1925), Shafer (1952, 1957), and Swadesh (1952), and with various Amerind 
groups. 

The ultimate question is, of course, whether all human languages are ge- 
netically related, in which case the problem of classification would be re- 
duced to one of subgrouping. This question is sometimes confused with 
that of monogenesis, though as Lamb (1964) points out, the questions are 
distinct. It is possible that all extant human languages are related, but that 
the theory of the monogenesis of language is not correct. This would be the 
case if human language had originated independently several times, and all 
but one strain had eventually become extinct. It is also possible that not all 
human languages are related. These are questions that, for some decades 
now, scholars have shied away from for essentially two reasons. First, the 
scholarly climate has not been receptive to such distant proposals, and the 
increasing specialization in academia has led in the other direction. Sec- 
ond, scholars supporting either monogenesis or the relatability of all lan- 
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Table 7.1. A Possible Global Etymology (Greenberg 1976) 





Phylum Form(s) Meaning Source? 
Nilo-Saharan tok~tek~dik ‘one’ Greenberg (1963) 
(South) Caucasian tit-i “finger” Gamkrelidze (1984) 

tito “single” Gamkrelidze (1984) 
Indo-European *d(e)ik ‘to point’ 
Uralic ik~odik~ytik ‘one’ Collinder (1977) 
Ainu tek ‘hand’ 
Japanese te ‘hand’ 
Eskimo-Aleut tik(-eq) (Esk.) “index finger' 

tik(-laq) (Al) ‘middle finger’ 
Sino-Tibetan *tik ‘one’ Benedict (1976) 
Yeniseian *tok ‘finger’ Bengtson (1986) 
Miao-Yao *nto? ‘finger’ Benedict (1975) 
Austroasiatic *ti? ‘hand, arm’ Benedict (1975) 
Austro-Tai “dian “finger, point Benedict (1975) 
Indo-Pacific tong~tang~teng ‘finger, hand, arm’ Greenberg (1971) 
Na-Dene t'ek~tikhi~4aq, (ka-)tleek~ | “опе” 

ti?eq— (ka-)tliki 'finger' 
Amerind tik ‘finger’ Greenberg (1987) 


“Where no source is cited I have found the forms myself in various standard dictionaries. 


guages run the risk of being branded Creationists and of therefore having 
their work disregarded by colleagues. Although these questions have been 
considered by many to be beyond the pale of scientific investigation, there 
are indications that this stone wall of opposition is beginning to crumble, 
and future consolidation of some of the groups in List 7.3 is likely. 

Most scholars have in recent years assumed, without bothering to exam- 
ine the evidence, that the stocks in List 7.3 must be so ancient that any trace 
of relationship would have long since disappeared, even if some (or all) of 
the groups did in fact derive from a common source. Greenberg, however, 
reported in a public lecture in 1976 that he had noticed a number of roots 
cemmon to several of these groups. One example he cited was the root 
tik with the meaning ‘finger’ or ‘one.’ Table 7.1 illustrates some possible 
members of this cognate set. Bengtson (1986) proposes 45 such global 


` etymologies. 


7.4 THE ORIGIN AND EVOLUTION OF LANGUAGE 


Few questions in the study of human language have attracted as much 
attention, provoked as much controversy, or resisted so resolutely their an- 
swers as that of the origin of language. As we saw in sections 2.1 and 2.2, 
this question was much debated during the eighteenth century, when He- 
brew was generally considered mankind's original tongue. The develop- 
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ment of comparative linguistics in the nineteenth century (see section 2.3) 
shifted attention from the origin of language to the origin of the European 
languages, a more tractable problem. Speculation on the origin of language 
continued, of course, throughout the nineteenth century, but the topic fell 
out of fashion with scholars, and the Linguistic Society of Paris, weary of 
the idle speculation, banned its discussion in 1866. In general the question 
of the origin of language remained out of favor through the 1960's as schol- 
ars shifted their attention from language change to language structure, that 
is, from a basically historical perspective to a static one. 

The past decade, however, has witnessed a resurgence in interest in this 
once-banned topic (see Hewes 1973, Lieberman 1975, 1984, Sagan 1977, 
Lyons 1977, Bickerton 1981, Leakey 1981, Itzkoff 1983). In 1975 the New 
York Academy of Sciences held a conference on the Origins and Evolution 
of Language and Speech (see Harnad, Steklis, & Lancaster 1976), at which 
some 70 papers were delivered on subjects as diverse as the linguistic com- 
petence of apes, language and the neurobiology of the human brain, the 
relationship between language and other cognitive systems, paleoneuro- 
logical evidence for language origins, child language acquisition and lan- 
guage origins, and sign language and the gestural theory of language 
origins." 

Yet despite the remarkable advances in our understanding of biology, ge- 
netics, paleontology, linguistics, the human brain, and semiotic systems 
that have been made during the past century, there was a total lack of con- 
sensus among the 1975 conference's participants on almost every question 
related to the origin of language. This suggests that we are not perceptibly 
closer to a solution to the enigma than were scholars of the nineteenth cen- 
tury. On the basic questions of when, how, and why human language de- 
veloped, there was much interesting speculation, with a number of “plau- 
sible scenarios" suggested, but there was virtually no convincing evidence 
produced to show that one particular scenario was the historically correct 
one. So that we may better understand the difficulty of the problem, let us 
briefly review some of the lines of research found in recent studies on the 
origin of language. 


7.4.1 Human Relatives 


As soon as it was realized that man is part of the animal kingdom, and in 
particular phylogenetically closest to the Old World apes, attempts were 
made to study the communicative systems and the physiology of apes and 


*One topic that was curiously overlooked at the conference was any attempt to assess what 
had become known in the last century about both the diversity and the uniformity of the 
world’s languages. Although ape language was discussed at length and in great detail, the 
world’s extant human languages were accorded very little attention. 
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monkeys to determine, first, why it was that monkeys and apes do not talk, 
and, second, what kind of rudimentary communication system they might 
have from which human language could have gradually evolved. Though 
the early studies had revealed little more than a high degree of similarity in 
the vocal apparatus of man and ape, the topic was resurrected in the 1970's, 
when several psychologists attempted to teach apes human language. One 
attempt to teach normal spoken English to a chimpanzee was generally un- 
successful, but other attempts, using sign language and arbitrary tokens 
manipulated by computer, revealed that chimps could learn much more of 
language than had hitherto been believed. In this regard it is useful to make 
a distinction between language and speech. Language is a symbolic com- 
municative system that represents meaning in terms of some arbitrary 
code. In all human languages (leaving aside sign language) the medium 
used to encode meaning is sound, and for this reason language and speech 
are often used synonymously. Chimps, for whatever reason, are not very 
adept at using the sounds of human language, but they are much more 
gifted in using sign language. Although no one claims that chimps can ap- 
proach human competence in the use of sign language, they can apparently 
learn a good deal of it and even use it creatively, an attribute that many be- 
lieve is restricted to man. (Interestingly, chimps also seem to have a concept 
of self, shown by their ability to recognize themselves in mirrors. Monkeys 
and lower primates lack this ability.) 

There is still great controversy over the meaning of these recent studies. 
Some linguists, such as Noam Chomsky and his followers, generally main- 
tain that human language is unrelated to the chimp's cognitive capabilities. 
Ronald Myers (1976: 755) expresses this viewpoint when he writes that "the 
speech of man has evolved not from the vocal responses of lower primates, 
but rather speech has developed de novo in man during his evolutionary de- 
velopment beyond the level of monkeys or, indeed, apparently, of the 
apes." Many others, however, believe it would be premature to dismiss this 
possibility, given our present rudimentary knowledge of the questions in- 
volved. According to Pribram (1976: 798), "The facts supporting biological 
evolution make it unlikely that human language has sprung de novo with no 
relationship to subhuman primate forms of communication." 


7.4.2 Human Ancestors 


If human relatives were poorly known in the nineteenth century, human 
ancestors were scarcely known at all. The discovery of man's closest an- 
cestors in the fossil record has been largely a twentieth-century develop- 
ment. Though human paleontology is by its very nature incomplete, great 
advances have been made, particularly in recent decades. The sketch that I 
present below is based largely on Eldredge and Tattersall (1982) and Pil- 
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beam (1984), whose views are widely if not universally accepted. The pur- 
pose of the following discussion of paleontology is to place the study of the 
origin and evolution of language in its proper biological context; it is not to 
defend any particular detail in the controversial area of human evolution. 

The three great apes are thought to have branched off from the evolu- 
tionary line leading to Ното sapiens in the following order: (1) orangutan (16 
million years before present [в.р.]), (2) gorilla (10 million в.р.), and (3) 
chimpanzee (7 million в.р.). The fossil record of man's ancestors from the 
period 8 million to 4 million B.P. remains a gap in our knowledge. When 
traces of mankind's ancestors reappear, in eastern Africa, around 4 million 
B.P., they are in the form of a creature who has been named Australopithecus 
afarensis. Small іп stature (3' to 412’ in height and weighing 50—100 165.) and 
small in brain size (450 cc), this ancestor shared one important trait with 
modern humans. Unlike apes and monkeys, it walked erect on two feet. So 
far as we know, it had no tools and did not use fire. 

This trait of bipedality defines a family containing man and “man-like” 
creatures, known collectively as hominids. The hominids in turn are usu- 
ally divided into two genera, Australopithecus and Homo. The earliest known 
form of Australopithecus (A. afarensis) is attested from roughly 4-3 million 
B.P. About 3 million B.P. a different variety of Australopithecus (A. africanus) 
replaced A. afarensis. А. africanus is represented in the fossil record from 
roughly 3-2 million В.Р. and then apparently split into A. robustus (which 
became extinct around 1% million years ago) and Homo habilis, which leads 
to modern man. Like the various australopithecine fossils, remains of H. 
habilis are so far known only from Africa. Descendants of H. habilis, H. erec- 
tus, are the only known human ancestors from the period 1.5—0.5 million 
B.P., and they are the first hominids found outside Africa. They were also 
our first ancestors to use fire and, with H. habilis, the first to use tools that 
have left a trace in the archaeological record. H. erectus is attested in Asia 
and Africa from roughly 1.6 million B.P. to 300,000 B.P. 

Between 400,000 and 300,000 years ago H. erectus disappeared and our 
own species, H. sapiens, appeared on the earth for the first time. An early 
type of H. sapiens is attested from 300,000 B.P. to 40,000 в.р. in Africa, Eu- 
rope, and Asia, and is usually referred to as "archaic" H. sapiens to distin- 
guish it from the type presently found on the earth (i.e. H. sapiens sapiens). 
One form of archaic H. sapiens, H. sapiens neanderthalensis, flourished in Eu- 
rope and the Near East between 160,000 в.р. and 35,000 B.P., and then sud- 
denly was replaced by H. sapiens sapiens everywhere, beginning around 
40,000 years ago. After 40,000 в.р., migrations carried mankind's ancestors 
to northern Eurasia, Australia, the Pacific Islands, and the New World. 

During the evolution from Australopithecus to Homo sapiens sapiens, there 
were a number of significant developments in both the biological and the 
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cultural realm. Size in general, and brain size in relation to body weight in 
particular, steadily increased, with most modern humans standing be- 
tween 4% and 6% feet tall and weighing between 100 and 200 Ibs. From at 
least the time of H. habilis, our ancestors used stone tools, at first primitive 
and then, with H. erectus and H. sapiens, progressively more complex and 
more varied. Big-game hunting was practiced by H. erectus, and ritual bur- 
ial is attested from more than 100,000 years ago. Some kind of cultural 
and/or biological transition appears to have taken place between 40,000 
and 30,000 years ago. "Neanderthals" suddenly disappeared, and H. sa- 
piens sapiens just as suddenly spread over the entire earth. More advanced 
kinds of tools, art, and other artifacts are attested for the first time. Less 
than 12,000 years ago agriculture was first practiced, in the Near East, and 
the transition from a nomadic hunterer-gatherer way of life to a sedentary 
agricultural society began. Finally, between 6,000 B.P. and 5,000 B.r., writ- 
ing was invented and history proper begins. 

Having thus briefly sketched the human ancestry, let us turn to the ques- 
tion of what, if anything, we can say about the linguistic capabilities of 
these various creatures. The linguistic ability of modern man (H. sapiens sa- 
piens) is well known, if not well understood, but since language leaves no 
fossil evidence, it would seem at first blush impossible to say anything at all 
about the speech (or lack thereof) of our extinct ancestors. But several at- 
tempts have nevertheless been made in recent years. Lieberman (1975) pro- 
posed that Neanderthal man, though a member of our species, had lin- 
guistic abilities that differed significantly from our own. On the basis of a 
reconstruction, using fossil evidence, of the vocal tract, Lieberman has ar- 
gued that Neanderthals would have been incapable of producing (1) vowels 
such as [i], [u], and [a], (2) velar consonants (e.g. k, е, 9), or (3) а nasal- 
nonnasal distinction (i.e. the difference between т and b, or n and d). АП 
three of these traits are characteristic of virtually every human language, as 
a perusal of the data in Volume 2 clearly shows. If Neanderthals really had 
such limitations, their speech would have been nothing like any extant hu- 
man language. The validity of Lieberman's reconstruction (made in collab- 
oration with his co-worker Edmund 5. Crelin) was called into question at 
the 1975 conference by E. Lloyd Du Brul (1976: 640), who claimed that such 
beings would not have been able to open their mouths, much less talk, and 
by Dean Falk (1976: 725), who argued that Lieberman's Neanderthals would 
not have been able to swallow either. Lieberman also claimed in 1975 that 
Neanderthal language possessed a syntax with a "transformational compo- 
nent," apparently reflecting his infatuation at that time with the transfor- 
mational model. His most recent and most comprehensive treatment of the 
origin of language (Lieberman 1984) specifically rejects some of his earlier 
positions; and if it does not offer us any definitive answers, it provides an 
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excellent summary of what is known and points squarely in the proper di- 
rection—toward biology. 

Another putative method of investigating the linguistic capabilities of 
human ancestors has been the examination of fossil endocasts, that is, fos- 
sils representing the inside of a skull, where the brain once was. Some 
scholars have claimed to be able to discern "Broca's area" or "Wernicke's 
area" (see section 7.4.3) in endocasts from australopithecines, but most 
paleobiologists consider such claims nothing more than legerdemain. Ac- 
cording to Ralph L. Holloway (1976: 330), "There is no good paleoneuro- 
logical evidence as yet from the fossil hominid record that either proves or 
disproves when this or that hominid acquired the capacity for language." 

The nature of the linguistic evolution that led to human language has 
several interpretations. On the one hand are those who maintain a strict 
Darwinian position of slow, gradual, progressive, and constant change over 
long periods of time. Others, however, believe that evolution, biological 
and linguistic, usually proceeds by swift and sharp transitions between 
stages, though any stage may persist with great stability over long periods 
of time. Contrary to Darwin's expectations, the fossil record simply does 
not support his conception of slow gradual change. A number of linguists 
have proposed that language also evolved by fits and starts, rather than by 
slow steady progression. The idea is often expressed that there were certain 
critical "thresholds" that led from one stage to another, though what those 
thresholds were, and when (and how often) they took place remain matters 
of conjecture. 

One hypothesis is that when the computing power of the brain passed 
some critical level, it became a qualitatively different instrument. Another 
proposal is that the original symbiosis of sound and meaning, the defini- 
tional quality of a word, led to an explosive growth in vocabulary. Indirect 
support for such a view has been cited from child language acquisition, 
from language learning by apes, and from Helen Keller's experience in 
learning language, in all of which a rapid explosive growth in vocabulary 
has been observed. Though there is no direct evidence, it seems likely that 
a similar phenomenon occurred at some point in the development of hu- 
man language. A third proposal, found in Bickerton (1981), is that the cru- 
cial step in the "hominization" of mankind was the transition from mono- 
propositional sentences to multipropositional sentences. Although the 
idea of certain thresholds separating qualitatively different stages is appeal- 
ing, there is really no direct evidence at present for any specific stage. As 
far as dates are concerned, the period 70,000—30,000 (or 40,000— 30,000) 
years ago is frequently cited as a time of rapid development, but what those 
developments might have been we can only speculate. 

This brings us to a final question. Do all of the world's roughly 5,000 lan- 
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guages represent the same stage of development, ог have some reached а 
higher level than others? As we saw in section 2.3, nineteenth-century lin- 
guists considered the inflectional stage of European languages more ad- 
vanced than the agglutinating stage (e.g. Turkish), which was in turn more 
advanced than the primitive isolating stage (e.g. Chinese). Modern lin- 
guists generally consider all languages to be equal, or at least at the same 
stage of development. Wang (1976) proposes that language evolved from an 
“emergent state,” during which language as we know it developed, into a 
“steady state,” where the basic linguistic structures are simply shifting 
around, rather than evolving further to a higher stage. Bickerton (1981: 
299), however, claims that “the belief that all existing languages are at the 
same level of development [has] no empirical foundation,” and I must 
agree with him on this point. Both the nineteenth-century racism that ar- 
ranged languages in an order running from primitive to advanced and the 
contemporary egalitarian view represent the prevailing ideologies of the 
different centuries more than anything else. It is not at all clear how we 
would recognize a language that was a stage ahead of (or behind) others. 
There can be no question that language developed through an emergent 
state, but whether that emergent state is over and a steady state has been 
achieved has yet to be demonstrated. Possibly the generally perceived 
steady state is rather the reflection of a long stable period of linguistic de- 
velopment, perhaps from a single origin, rather than the cessation of the 
emergent stage. 


7.4.3 The Human Brain 


During the nineteenth century it was first noticed that people whose 
brains had been damaged in certain areas, as the result of accidents, ex- 
hibited a corresponding impairment in language. Lesions to Broca's area af- 
fected the production of sounds, whereas lesions to Wernicke's area tended 
to affect the perception of sounds. Different forms of aphasia are associated 
with each area. Unfortunately, many aspects of language (e.g. the lexicon) 
defy any neat localization in the brain, and even the location of Wernicke's 
area can be specified only probabilistically. Oscar Marin (1976: 902) con- 
cludes that "it would seem premature, at this stage, to try to plug linguistic 
theory into the brain." 

It has long been known that the human brain is lateralized, that is, the 
left and right hemispheres serve different functions. In most normal adults 
language is controlled by the dominant (left) hemisphere, where both 
Broca's area and Wernicke's area are usually located. Furthermore, human 
language appears to be controlled primarily by the brain's neocortex, 
whereas the communicative abilities of lower animals seem to be controlled 
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by the limbic system. The neocortex is a much later evolutionary develop- 
ment than the limbic system, though it is by no means restricted to man. 

Several scholars have attempted to use our meager understanding of the 
brain to approach the problem of language origins. Claims that Broca's area 
can be discerned in australopithecine endocasts is one example, though in 
all likelihood a purely fanciful one. Others have attempted to correlate 
brain weight (especially in relationship to body weight) with the evolution 
of language, but it is now generally thought that brain structure is more 
important than brain size alone. 

Brain lateralization has also been adduced as a potential clue to the ori- 
gin of language. The fact that both "handedness" and human language are 
intimately tied up with lateralization has been interpreted by some as sup- 
port for the "gestural" origin of language. The vast majority of people are 
right-handed, and it is the left hemisphere that controls the right hand, just 
as it is the left hemisphere that controls language (in most people). Further- 
more, brain lateralization apparently also existed in Horno erectus over a half 
million years ago, as indicated by the stone tools associated with the Chi- 
nese variety of H. erectus, which display right-handed chipping. Since it 
now appears that chimpanzees (but not monkeys) also exhibit some degree 
of lateralization, this trait may be a necessary criterion for human speech, 
but not a sufficient one. 

According to Marin, Saffran, and Schwartz (1976: 868—69), “Тһе human 
brain is a vastly complex instrument; its operating principles with respect 
to language, and indeed, to behavior in general are practically unknown." 
As long as this remains the case it is unlikely that a study of the brain will be 
able to shed much light on the origin of language. 


7.4.4 Child Language Acquisition 


The German biologist Ernst Haeckel noted in the nineteenth century 
that "ontogeny recapitulates phylogeny," that is, the development of an in- 
dividual reflects, imperfectly and incompletely perhaps, the evolutionary 
development of the species to which the individual belongs. Lamendella 
(1976) explored the possibility that the development of language in the 
child, which has received considerable attention since the 1960's, may well 
parallel in some measure the evolution of language in the human species. 
Bickerton (1981) pursues the same line of reasoning. Since we now have at 
least some understanding of the different stages in child language develop- 
ment, we may speculate that these stages of development also reflect to 
some degree the linguistic history of our species. Lamendella's hypothesis 
is in my opinion highly plausible, but there is really no solid evidence to 
support it at the moment. 
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Despite the enormous advances in our understanding of the human spe- 
cies made during the past century, all this new information sheds very little 
direct light on the origin and evolution of language. Though we may have 
more informed speculation than scholars of earlier times, we have no hard 
answers to the fundamental questions. We do not know when language de- 
veloped, or even whether the time span involved should be measured in 
tens of thousands or in millions of years. Nor do we know how it devel- 
oped, whether gradually and constantly over a long period of time, or epi- 
sodically with sharp transitions from stage to stage. Finally, why it devel- 
oped when it did (whenever that was) is perhaps the biggest enigma of all. 
Arguments that it was developed when it was needed for some specific pur- 
pose (e.g. big-game hunting, toolmaking, societal organization) strike me 
as quite implausible. Language is an all-purpose tool; it is not a can opener. 
It is, as Alfred Russel Wallace suggested in 1869, "an instrument. . . devel- 
oped in advance of the needs of its possessor" (quoted in Wang 1976: 62). 
More likely the evolution of language is to be interpreted as an extension of 
the general cognitive abilities of our ancestors, the adventitious coupling of 
sound and meaning. The “vocal tract,” it should not be forgotten, is an as- 
semblage of organs that evolved with, and still serve, other functions (e.g. 
breathing, eating). Their use in language is clearly secondary. Similarly, we 
must assume considerable cognitive preadaptation, for language consists of 
more than sounds. 

At the 1975 conference on language origins, Ralph Holloway (1976: 330) 
aptly summarized the current state of our knowledge, when, after first la- 
menting that perhaps the French were right in banning any discussion of 
the origin of language, he wrote that "for me, at least, the very fact that this 
Conference has been organized, and that human animals are ready to en- 
gage in a great 'garrulity' over the merits and demerits of essentially un- 
provable hypotheses, is an exciting testimony to the gap between human 
and other animals." 
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Genetic Classification of 
the World's Languages 


This chapter offers a genetic classification of all the world's extant languages 
(and many of the extinct ones), as best it can be determined from available 
evidence. The opening section (8.1) discusses problems related to the nam- 
ing of languages and groups. Section 8.2 presents the principles and con- 
ventions I have used in arranging the languages and groups in the classifi- 
cation itself (8.6). Section 8.3 offers an overview of the world's language 
phyla. Sections 8.4, a general index, and 8.5, a skeleton outline of the major 
genetic groups (with exemplary languages), provide successive approxima- 
tions to the complete classification and are intended to help readers orient 
themselves in the overall classification, where it is easy to become lost. The 
number following each group in these two indexes indicates the page on 
which that group begins in the complete classification. 

The reader will notice that the taxonomic ambiguities, disputes, and un- 
certainties emphasized in Chapters 2-6 do not figure in the final classifica- 
tion, which is presented as if all problems had been resolved. Readers who 
wish to know which portions of the classification are firmly established, 
and which controversial, should consult the corresponding sections in 
the text. 

The sources used in compiling the classification have been listed at the 
ends of sections in Chapters 2-6 and in the section Specialists Consulted, 
but a brief word on how the classification arrived at its present state may be 
instructive. The original classification was put together in the mid-1970's 
and was based almost entirely on Voegelin and Voegelin (1977). In late 1979 
I became aware that certain parts of the Voegelins' book (which reflected the 
state of linguistic taxonomy around 1970) had already been superseded by 
more recent publications, and I began to update the classification to take 
account of this work. 
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Іп 1981 I wrote to some 200 scholars, who I knew from their publications 
were interested in taxonomy (most linguists are not), and requested their 
advice on the portions of the classification in which they had erpertise. 
More than 100 of them responded to my request, often with exquisitely 
long and detailed letters suggesting major revisions, at other times briefly 
on questions of detail. The vast majority of their suggestions were incorpo- 
rated into the classification, which has been kept on computer since 1981 to 
facilitate its revision. At the same time I undertook a more thorough exami- 
nation of the taxonomic literature and discovered a wealth of information 
on various groups that was also incorporated into the classification. 

Yet the final classification does not always directly reflect the sources and 
the counsel of the experts, since for many, perhaps most, of the families (at 
whatever level), I had collected more than one classification. I had had a 
complete classification to start with (i.e. Voegelin & Voegelin 1977). Grimes 
(1978) provided another complete classification, though it was based partly 
on the Voegelins' book. In addition to these two complete classifications I 
collected many other classifications, of groups large and small, from the lin- 
guistic literature and from my own correspondence. So when it came time 
to decide what the classification of a group should be, I frequently had 
a half-dozen different classifications of that group. Where all, or most, 
agreed, there was no problem, but when they differed, as they often did on 
details and sometimes on major points, reconciliation was required. Al- 
though it might seem safest in such instances to choose one of the classifi- 
cations and follow it faithfully, I quickly realized that this was not the best 
tactic, because each of the classifications had its own strengths and weak- 
nesses. Limiting myself to one person's classification almost invariably en- 
tailed adopting parts of a classification that were clearly inferior to someone 
else's. Í therefore attempted to incorporate into the classification, when 
there were conflicts between sources, what seemed to me to be the strengths 
of each classification, always taking into account such factors as who had 
produced a classification, how it had been produced, on what it was based, 
and when it was done. The reconciliation of conflicting classifications is a 
delicate task, at which I have, no doubt, not always been successful. But 
because classifications continually evolve, particularly when the languages 
are poorly known and little studied, the classification offered in section 8.6 
should be considered provisional in many regards and not the final word. It 
is intended to stimulate further research, not to put an end to it. 


8.1 NAMES 


To most people, language names appear straightforward. The English 
speak English, the French French, and the Chinese Chinese. If Americans, 
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Canadians, Australians, and New Zealanders also speak English, they still 
call it English, and it is clearly the same language, with relatively slight dia- 
lectal variation. Yet this simple view of language names is really valid only 
for "famous languages," those languages everyone has at least heard of 
(e.g. English, Chinese, Japanese, Eskimo), and even here the names often 
conceal problems the layman is not aware of. For all but a few hundred of 
the world's roughly 5,000 languages, the names are known only to spe- 
cialists, the people themselves, and their neighbors, all three of whom fre- 
quently have different names for the language in question. In addition, the 
naming problem is intimately tied up with the language/dialect problem 
discussed in section 1.2. In much of the world, then, language names do 
not correspond to neatly defined groups of mutually intelligible dialects. It 
is common, in fact, for mutually intelligible dialects to be regarded as dif- 
ferent languages with distinct names, whether for national (e.g. Swedish, 
Danish, Norwegian), religious (e.g. Hindi, Urdu), or other political and 
cultural reasons (e.g. Czech, Slovak). It is equally common for a single 
language name to subsume a number of very divergent dialects, not all 
of which are mutually comprehensible (e.g. Romany, Saami [= Lapp], 
Nahuatl, Quechua). And in between these two extremes there is a multi- 
tude of cases where different varieties of speech are considered one lan- 
guage by some, two or more by others (e.g., the two major dialects of Alba- 
nian, Gheg and Tosk, which are sometimes accorded the status of distinct 
languages). 

It thus often depends on nonlinguistic factors whether distinct varieties 
of speech are subsumed under one name or given separate names. In this 
regard linguistic taxonomy is no different from biological taxonomy, where 
the species/variety distinction shares many of the same problems as the 
language/ dialect distinction. Dogs and wolves are considered separate spe- 
cies (Canis familiaris and Canis lupus, respectively) even though they may 
interbreed and produce viable offspring. On the other hand, some varieties 
of dogs (e.g. Chihuahuas and Great Danes) do not interbreed and yet are 
still assigned to the same species. To some extent the naming conventions 
in both biology and linguistics are a matter of convenience and are affected 
at the lowest levels of classification by nonbiological and nonlinguistic 
considerations. 

In general, the names adopted in this volume reflect either those of the 
sources used in compiling the classification or those recommended to me 
by the scholars with whom І corresponded. Іп most cases I have not tried 
to iron out the taxonomic inconsistencies mentioned above; Swedish, 
Danish, and Norwegian are listed as three distinct languages, whereas 
Nahuatl is treated as one. The endpoints in the classification thus approxi- 
mate distinct languages, but both overdifferentiation and underdifferentia- 
tion are present in some degree. 
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Table 8.1. Pejorative Language Names 














Pejorative name Preferred Pejorative name Preferred 
(language group) name (language group) name 
Alacaluf (Andean) Oawasgar Hottentot (Khoisan) Nama (or Khoe) 
Auca (Andean) Sabela Igabo (Edo) Isoko 
Barya (East Sudanic) Nera Ingalik (Athabaskan) None known 
Batangan (Meso- Kafir (Indo-Iranian) Nuristani 
Philippine) Taubuid Lapp (Finno-Ugric) Saami 
Beriberi (Saharan) Kanuri Matakam (Chadic) Mafa 
Boni (Cushitic) Aweer Mikeyir (East Sudanic) Shabo 
Buzi (Mande) Loma Namshi (Adamawa) Doyayo 
Chama (Tacana) Ese'ejja Shosho (Plateau) Birom 
Dagada (Timor-Alor- Tagabili (South Mindanao)  Tboli 
Pantar) Fataluku Yaghan (Andean) Yamana 
Galla (Cushitic) Oromo Zumper (Jukunoid) Kuteb 


Not infrequently a language is known by more than one name (e.g. Irish 
and Gaelic, Persian and Farsi), and in such common cases I have had to 
make a choice, though both terms are found in the language index at the 
end of this volume. In most cases I have tried to adopt the most widely 
used term or the term preferred by the people who actually speak the lan- 
guage. Unfortunately, these two criteria are often in conflict, and there is a 
growing movement, which I have followed in many cases, to replace the 
older, better-established term with the name the people themselves use. In 
many such cases the older terms are still listed in the language index to this 
volume to facilitate finding the current equivalent. I have not, however, at- 
tempted to provide a complete list of all the language and dialect names 
and their variant spellings; for these the reader is referred to Voegelin and 
Voegelin (1977) and Grimes (1984a,b). 

In a number of cases the most widely used name is felt by the people 
who speak the language to be pejorative, and in such cases I have re- 
placed that name with a non-pejorative one. Table 8.1 shows the pejora- 
tive language names that have so far come to my attention—with their 
replacements. 

When faced with the problem of selecting a language name for a group 
of mutually intelligible dialects, each with its own name, different linguists 
follow different strategies. One strategy is simply to list the separately 
named dialects as if they were distinct languages (e.g. Swedish, Danish, 
Norwegian). Another is to "invent" a name that subsumes the dialects in 
question (e.g. Scandinavian). A third is to use hyphenated names (e.g. 
Swedish-Danish-Norwegian). Finally, one may choose the name of one dia- 
lect (usually the dialect with the most speakers, the highest prestige, or the 
greatest intelligibility with other dialects) as the language name. I have 
pointed out that some cases of overdifferentiation (the first strategy) have 


8.2 Taxonomic Conventions 279 


been allowed to persist іп the classification. Іп other cases it has seemed 
advisable to list dialects under a single name, and in such cases the last 
strategy has been preferred. I have avoided hyphenated names (the third 
strategy) almost entirely, because they are unwieldy and because they are 
likely to be confused with family names. Invented names (the second strat- 
egy) have been used sparingly, and none of the inventions are of my own 
making. This means that certain dialects, even a few that some people 
might consider separate languages, will not be found in the classification. 
Some of these dialect names are listed in the language іпдех, with ап іп- 
dication of the language they are considered to belong to. For those not 
found in the language index, the reader is advised to consult Voegelin and 
Voegelin (1977) and Grimes (1984a,b), both of which may be used as in- 
dexes to this volume. 

Names for language groups have somewhat different problems. In con- 
trast to biology, where a group is named by its discoverer and everyone else 
is expected to respect that name, in linguistics different scholars often pro- 
pose different names for the same group. Sometimes a name formerly used 
fora group becomes unsuitable and is replaced by another (e.g. since around 
1950 Hamito-Semitic has been increasingly replaced by Afro-Asiatic; see 
sections 3.1 and 3.2). In other cases different family names imply different 
subgrouping hypotheses (e.g. Indo-Hittite vs. Indo-European; see section 
2.3). Some presumed families are later found not to be valid genetic nodes, 
and their names disappear from the classification altogether (e.g. Chari- 
Nile and Kwa; see sections 3.3 and 3.4). Generally, I have tried to use the 
family names currently used by specialists; Indo-Hittite thus represents a 
deviation, for Indo-European is by far the more common term today. I have 
throughout preferred hyphenated family names to their unhyphenated 
variants (e.g. Indo-European, Afro-Asiatic vs. Indoeuropean, Afroasiatic; 
Austroasiatic, rather than Austro-Asiatic, is not an exception to this rule 
since "Austro-" here means "southern," and thus the meaning of the name 
is "Southern Asiatic"). 


8.2 TAXONOMIC CONVENTIONS 


The reader who examines the complete classification in section 8.6 will 
notice that the phyla, families, and languages are sometimes arranged in an 
order different from that followed in Chapters 2-6. This is because I have 
adopted a didactic, and largely historical, perspective in the text, but the 
classification itself follows other principles. This section will explain ex- 
actly what those principles and conventions are. 

Not only do linguists have different methodological biases in matters of 
taxonomy (e.g. lexicostatistics vs. traditional comparative approaches), 
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they also have different methods of representing the genetic relationships 
schematically. Since the present work reguires a uniform format for all the 
worlds languages, I have devised one that is intended to maximize read- 
ability while expressing the genetic relationships as precisely as possible. 
All of the classifications collected from the literature, or sent me by col- 
leagues, have been translated into this framework. 

In simplest terms, the underlying principle in arranging the groups and 
languages in the classification is that phyla, stocks, families, and languages 
(i.e. all genetic units at whatever level) should be ordered so that each is 
preceded and followed in the classification by the phyla, stocks, families, or 
languages to which they are most closely related. There are cases where 
this principle cannot be applied with total consistency because a language 
or group appears to be equally closely related to more than two languages 
or groups. But the principle nonetheless has great heuristic value and is 
useful if its limitations are not overlooked. 

Unfortunately, genetic relationships are not always expressed with total 
precision, and it is often unclear in the literature whether the ordering of 
the languages is significant or not—and if it is, what that significance is. 
Since the genetic relationships themselves are so frequently underspecified, 
I have followed a hierarchy of principles in compiling the classification, 
which I shall discuss in their order of precedence: 

]. Where the information is sufficiently precise I have followed strict ge- 
netic principles. Languages and groups are listed in an order that puts them 
closest to the languages or groups to which they are most closely related. 

2. In the absence of knowledge of the specific genetic relationships, I 
have often followed a geographic arrangement (e.g. north to south, or east 
to west). The ordering of the 17 phyla is roughly geographic, starting 
in Southern Africa (Khoisan) and proceeding around the world to the 
tip of South America (Amerind). Even at this level genetic considerations 
are not completely absent: the five constituents of Greenberg's Eurasiatic 
family (i.e. Indo-Hittite, Uralic-Yukaghir, Altaic, Chukchi-Kamchatkan, 
and Eskimo-Aleut) are listed consecutively, in anticipation of further con- 
solidation among these groups. 

3. In a good many instances I have simply followed the order of presen- 
tation in the source, without knowing its precise basis. 

4. Where it does not violate any known genetic relationships I have tried 
to let small groups precede large groups. This increases readability, since it 
produces taxonomic structures that branch continuously to the right, 
making orientation easier within the classification, especially from page to 
page. In some cases genetic considerations preclude this, and in these rare 
instances the numbering next to each group (e.g. II, E, 6, d) should allow 
the reader to identify taxonomic units at the same level, even if they occur 
on different pages. 


FAMILY X 





Fig. 8.1. Branching-tree diagram of family X 


List 8.1. Representation of the Branching-Tree 
Diagram of Family X 


FAMILY X: 
I PRIMARY BRANCH 
H PRIMARY BRANCH 
HI PRIMARY BRANCH: 
A GROUP 
B GROUP: 
1 SUBGROUP 
2 SUBGROUP 
3 SUBGROUP: 
a BRANCH 
b ВКАХСН: 
i SUBBRANCH 
ii SUBBRANCH: 
a GROUP 
B GROUP 
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Тһе formalism used to represent genetic trees is explained briefly іп the 
introductory “Note on Notation.” As stated there, the general principle is 
that branching trees such as that in Fig. 8.1 are given the representation 
shown іп List 8.1. Each branch is preceded by a letter or number that identi- 
fies the taxonomic level of the group. The order in which these identifying 
tags are used is capital Roman numerals (LI), capital letters (A,B), Arabic 
numerals (1,2), lower-case letters (a,b), small Roman numerals (1,1), and 
lower-case Greek letters (о, В). If branching continues beyond six levels, the 
numbers and letters start over, beginning with capital Roman numerals. 

The schema illustrated in List 8.1 is sometimes modified by the following 
abbreviatory convention. Individual languages that are coordinate with 
groups of languages (i.e. that are at the same hierarchical level) are listed 
first in the classification, preceded by their quantity. This simply means that 


LAMOGAI [6]: 
[3]: Longa, Idne, Mok 
] RAUTO [3]: Lamogai, Pulie, Rauto 


is a shorter way of expressing: 


LAMOGAI [6]: 

1 LONGA [1]: Longa 

2 IDNE [1]: Idne 

3 MOK [1]: Mok 

4 RAUTO [3]: Lamogai, Pulie, Rauto 


Languages listed in this special initial position fall into three classes: (1) 
languages that are known to form an independent group within a higher- 
level family (e.g. Catawba within Siouan), (2) languages that do not seem to 
belong to any of the groups at a particular level, but whose independent 
position is perhaps due to a lack of information on the language in question 
rather than to its proven isolation from the other languages and groups 
(e.g. the 14 languages listed at the beginning of the Australian phylum), or 
(3) languages that are the parent of the group that follows (e.g. Latin and 
Romance). The third category is clearly distinct from the first two, and in 
fact is used sparingly. It is, however, often impossible to distinguish the 
first two categories in practice, even though they may be theoretically dis- 
tinct. If an individual language "fits," taxonomically, between two groups 
of languages, the abbreviatory convention is not used, for its use would dis- 
tort the correct genetic structure. This abbreviatory convention is also not 
used in certain cases where I wish to emphasize the known isolated nature 
of the language (e.g. Albanian and Armenian within Indo-European). In 
such cases the failure to use the convention is simply a stylistic variant and 
should be accorded no special importance. 
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8.3 OVERVIEW OF LANGUAGE PHYLA 


Map 8.1 shows the general geographic distribution of the world's lan- 
guage families. Table 8.2 provides a synopsis of the world's language phyla, 
offering ап indication of the number of languages in each group, the num- 
ber of speakers in each group, the geographic location of the phyla, and the 
better-known languages within each group. Estimates of the number of 
speakers of languages and language groups are notoriously difficult to 
make with confidence in many cases. І have relied most heavily оп Voegelin 
and Voegelin (1977), Grimes (1984a), and for Oceania, Wurm and Hattori 
(1981-83). 


Grimes, Barbara Е., ed. 1978. Ethnologue, 9th ed. Dallas. 

. 1984а. Ethnologue, 10th ed. Dallas. 

. 1984b. Index. Dallas. 

Voegelin, C. F., and F. M. Voegelin. 1977. Classification and Index of the World's Lan- 
guages. New York. 

Wurm, Stephen А., and Shiró Hattori, eds. 1981-83. Language Atlas of the Pacific 
Area, 2 vols. Stuttgart. 
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Table 8.2. Overview of Language Phyla 





Number of 
ertant 
Phylum languages 

KHOISAN 31 
NIGER-KORDOFANIAN 1,064 
NILO-SAHARAN 138 
AFRO-ASIATIC: 241 
tAncient Egyptian — 
Berber 30 
Chadic 123 
Omotic 34 
Cushitic 35 
Semitic 19 
CAUCASIAN 38 
INDO-HITTITE: 144 
TAnatolian = 
Armenian 1 
+Тосһапап — 
Indo-Iranian 93 
Albanian 1 
Greek 2 


Number of 
speakers 


120,000 
181,000,000 


11,000,000 
175,000,000 


11,000,000 
30,000,000 
1,000,000 
12,000,000 
121,000,000 
5,000,000 

2 billion 
5,000,000 
700,000,000 


4,000,000 
10,000,000 


Location or country 


South Africa, Namibia, 5 Angola, 
Botswana, N Tanzania 
С апа 5 Africa 


С Africa 

М Africa, Near East 

Egypt 

Algeria, Morocco, Tunisia, Libya, 
Mauritania, Senegal 

Chad, Niger, Ghana, Nigeria, Cam- 
eroon, Central African Republic, 
Togo, Benin 

W Ethiopia, N Kenya 

Somalia, Ethiopia, Sudan, Kenya, 
Tanzania 

М Africa, Near East 


Caucasus (U.S.S.R.) 

Europe, SW Asia, India, Americas, Aus- 
tralia, South Africa, New Zealand 

Turkey 

U.S.S.R. 

W China 

Iran, Afghanistan, Pakistan, India 


Albania 
Greece 


Better-known languages 
Nama (= Hottentot) 


Fula (= Fulani), Mandinka, Yoruba, 
Igbo, Swahili, Kongo, Luganda, 
Rwanda, Shona, Tswana, Xhosa, 
Zulu 

Kanuri, Luo, Nubian, Maasai, Songhai 


ТАпсіепі Egyptian 
Shilha, Kabyle, Riff, Tuareg, Tamazight 


Hausa 


Ometo 
Somali, Oromo 


Arabic, Hebrew, Aramaic, Amharic, 
Tigrinya 
Georgian 


THittite 

Armenian 

TTocharian 

Romany, Farsi (= Persian), Kurdish, 
Pashto, Punjabi, Gujarati, Hindi- 
Urdu, Marathi, Bengali 

Albanian 

Greek 


Zqc 


Italic 


Celtic 
Germanic 


Balto-Slavic 


URALIC-YUKAGHIR 


ALTAIC: 
Turkic 


Mongolian 

Tungus 

Korean 
Japanese-Ryukyuan 
Ainu 


CHUKCHI-KAMCHATKAN 
ESKIMO-ALEUT 


ELAMO-DRAVIDIAN 
SINO-TIBETAN 


AUSTRIC: 
Miao-Yao 
Austroasiatic: 

Munda 
Mon-Khmer 


Daic 


16 


15 


24 


63 
31 


28 
258 


1,175 


155 
138 
57 


500,000,000 


2,500,000 
450,000,000 


290,000,000 


22,000,000 


250,000,000 
80,000,000 


3,000,000 
80,000 
55,000,000 
115,000,000 
few 

23,000 
85,000 


145,000,000 
1 billion 


293,000,000 
7,000,000 


56,000,000 
6,000,000 
50,000,000 


50,000,000 


Rumania, Italy, France, Spain, Portugal, 
Central and South America 


Ireland, Wales, N France 

Germany, Holland, Scandinavia, Great 
Britain, North America, Australia, 
New Zealand, South Africa 

U.S.S.R., Poland, Czechoslovakia, 
Yugoslavia, Bulgaria 


Finland, Estonia, Hungary, U.S.S.R. 


Asia 
Turkey, U.S.S.R., Iran 


Mongolia, China, U.S.S.R. 

E U.S.S.R., China 

Korea 

Japan 

N Japan, S Sakhalin Island (U.S.S.R.) 

NE Siberia (U.S.S.R.) 

Alaska, N Canada, Greenland, 

NE U.S.S.R. 

S and E India, S Pakistan 

China, Tibet, Nepal, India, Burma, 
Thailand, Laos 

SE Asia, Oceania 

S China, N Vietnam, N Laos, 

N Thailand 

NE India, SE Asia 

NE India 

NE India, SE Asia, Nicobar Islands 


S China, SE Asia 


tLatin, Rumanian, Italian, French, 
Provencal, Catalan, Spanish, 
Portuguese 

Irish, Welsh, Breton 

German, Yiddish, Dutch, Afrikaans, 
English, Danish, Swedish, Nor- 
wegian, Icelandic 

Lithuanian, Latvian, Russian, Ukrai- 
nian, Byelorussian, Polish, Czech, 
Slovak, Serbo-Croatian, Bulgarian 

Hungarian, Finnish, Saami (= Гарр), 
Estonian 


Turkish, Uzbek, Uighur, Azerbaijani, 
Turkmen, Tatar, Kazakh, Kirghiz, 
Chuvash, Bashkir 

Khalkha (= Mongolian) 

Manchu, Evenki 

Korean 

Japanese 

Ainu 

Chukchi 

Eskimo, Aleut 


Telugu, Kannada, Tamil, Malayalam 
Mandarin, Wu, Yue (= Cantonese), 
Tibetan, Burmese, Karen 


Miao, Mien (= Yao) 


Santali, Mundari 

Vietnamese, Mon, Khmer 
(= Cambodian) 

Thai (= Siamese), Lao (= Laotian), Li, 
Shan, Zhuang, Kam 


Continued 
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Table 8.2. Overview of Language Phyla (continued) 


Number of 
extant 

Phylum Janguages 
Austronesian: 959 
Western 533 
Eastern (= Oceanic) 426 
INDO-PACIFIC 731 
AUSTRALIAN 170 
NA-DENE 34 
AMERIND: 583 
Kutenai 1 
Algic: 16 
Ritwan 1 
Algonguian 15 
Mosan: 27 
Chimakuan 1 
Wakashan 6 
Salish 20 
Keresan 2 
Siouan-Yuchi 11 
Caddoan 4 
Iroquoian 7 
Penutian 68 


Number of 
speakers 


180,000,000 
179,000,000 


1,500,000 
2,735,000 


30,000 
202,000 


18,000,000 
200 (in 1977) 
91,000 

10 (in 1980) 
91,000 
9,500 

10 (in 1977) 
2,700 

6,800 

7,000 
21,000 
1,000 
15,000 
3,200,000 


Location or country 


Oceania, S Vietnam, Madagascar 
Madagascar, Formosa, Indonesia, 
Philippines, 5 Vietnam, Kampuchea 


Melanesia, Micronesia, Polynesia 

New Guinea, Timor, Alor, Pantar, 
Halmahera, New Britain, New 
Ireland, Bougainville, Solomons, Reef 
Islands, Santa Cruz, Andaman Is- 
lands, Tasmania 

Australia 

Alaska, W Canada, Oregon, California, 
Arizona, New Mexico 

North, Central, and South America 

Montana, Idaho 

Canada, U.S. 

N California 

C and E Canada, C and E U.S. 

NW U.S., SW Canada 

NW Washington 

SW Canada 

SW Canada, NW U.S. 

New Mexico 

C U.S. 

C US. 

E U.S. 

W Canada, W and SE U.S., 5 Mexico 


Better-known languages 


Malagasy, Javanese, Sundanese, Malay, 
Tagalog, Cebuano, Ilokano, Hili- 
gaynon 

Fijian, Samoan, Tahitian, Hawaiian 

tTasmanian, Enga, Wantoat, Telefol, Jat- 
mul, Asmat 


Western Desert Language 
Navajo, Apache 


Kutenal 


Yurok 
Ojibwa, Cree, Blackfoot, Cheyenne 


Quileute 

Nootka, Kwakwala (= Kwakiutl) 

Shuswap, Kalispel, Squamish 

Keres 

Dakota, Crow 

Wichita, Pawnee 

Cherokee, Mohawk 

Chinook, Zuni, Muskogee (= Creek), 
Quiche, Cakchiquel, Kekchi, Mam, 
Yucatec 


697 


Hokan 


Tanoan 
Uto-Aztecan 
Oto-Manguean 
Chibchan-Paezan 


Andean 


Macro-Tucanoan 
Eguatorial 
Macro-Carib 
Macro-Panoan 
Macro-Ge 


55,000 


7,400 
1,100,000 
1,700,000 

200,000 


8,500,000 


35,000 
3,000,000 
50,000 
50,000 
10,000 


California, Arizona, Texas, Mexico, 
Colombia 

New Mexico, Oklahoma 

W U.S., Mexico 

5 Mexico 

Florida, S Mexico, Central America, 

W South America 

W South America 


NW and E South America 
South America, Caribbean 
N South America 

W South America 

E South America 


Mohave, Yuma, Tlapanec, Tequistlatec 


Kiowa, Tewa 
Comanche, Hopi, Nahuatl (= Aztec) 
Otomi, Mixtec, Zapotec 
Tarascan, Yanomami, Guaymi, Cuna, 
Paez, Warao, Embera, Cayapa 
Quechua, Aymara, Mapudungu 
(7 Mapuche) 
Ticuna, Tucano, Nambikuara, Puinave 
Guarani, Tupi, Goajiro, Arawak 
Galibi (7 Carib), Witoto 
Toba, Tacana 
Bororo, Chavante 
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8.4 INDEX TO THE CLASSIFICATION 


The following is simply a list of the largest groupings—often called 
phyla—into which linguists place the world's languages. It serves both as a 
quick overview of the major language groups presented in the subsequent 
10 pages and as a page-number index to the complete classification begin- 
ning on p. 301. The first 17 numbered groups span the proposed classifica- 
tion of the world's languages. Although great bodies of research, pursued 
over centuries, have studied relationships within these 17 phyla, few se- 
rious attempts have been made to proposed relationships between them. Be- 
cause the assemblage given as 13, Austric, will be unfamiliar and uncon- 
ventional to many workers, its proposed component groups are given here. 

The final four groupings in the list are of languages that for various rea- 
sons cannot be placed in the worldwide classification: 18, Language Iso- 
lates, lists a handful of languages, such as Basque, that have no obvious 
relatives anywhere in the world; 19, Unclassified, lists a handful of poorly 
documented languages that have not yet been assigned by linguists to par- 
ticular major groups; 20, Pidgins and Creoles, lists a number of languages 
derived from short- or long-term mixing of one language with one or more 
others, typically as a result of maritime trade or slaveholding; and 21, In- 
vented, lists the major attempts, like Esperanto, to invent languages from 
the ground up, for purposes of impartial international communication. 


1 KHOISAN 301 13 AUSTRIC 334 

2 NIGER-KORDOFANIAN 302 I MIAO-YAO 334 

3 NILO-SAHARAN 317 II AUSTROASIATIC 534 

4 AFRO-ASIATIC 320 III AUSTRO-TAI 337 

5 CAUCASIAN 324 A DAIC 337 

6 INDO-HITTITE 325 B AUSTRONESIAN 338 
7 URALIC-YUKAGHIR 328 14 INDO-PACIFIC 354 

8 ALTAIC 328 15 AUSTRALIAN 362 

9 CHUKCHI-KAMCHATKAN 329 16 NA-DENE 365 

10 ESKIMO-ALEUT 2329 17 AMERIND 366 

11 ELAMO-DRAVIDIAN 330 18 LANGUAGEISOLATES 377 
12 SINO-TIBETAN 331 19 UNCLASSIFIED 377 


20 PIDGINS AND CREOLES 377 
21 INVENTED 378 


8.5 Major Language Groups 


8.5 MAJOR LANGUAGE GROUPS 


The following is an outline of section 8.6, Complete Classification. Lan- 
guages are given іп capital-and-lower-case letters, language groups in all 
capitals. Languages to the right of colons are examples from the language 
group to the left of the colon. Languages and language groups preceded by 


a dagger are extinct. 


1 KHOISAN 301 
I Hadza 
H Sandawe 
III SOUTHERN AFRICA: 
A NORTHERN: Qxà (= !Kung), Maligo 
B CENTRAL: Nama (= Hottentot), G//abake 
C SOUTHERN: £Há, N/huki 


2 NIGER-KORDOFANIAN 302 
I KORDOFANIAN: Кадир, Rere 
II NIGER-CONGO: 
A MANDE: Maninka, Kpelle, Mende 
B NIGER-CONGO PROPER: 
1 WEST ATLANTIC: Fula (= Fulani), Wolof 
2 CENTRAL NIGER-CONGO: 
a NORTH CENTRAL NIGER-CONGO: 
i KRU: Bete 
ii Dogon 
iii СОК (= VOLTAIC): Dagbani, More, Senari 
iv ADAMAWA-UBANGIAN: Mumuye, Gbaya 
b SOUTH CENTRAL NIGER-CONGO: 
i IJO-DEFAKA: Kalabiri 
ii WESTERN: 
a NYO: Akan, Kposo, Ga 
p TOGO: Ewe 
iii EASTERN: 
о CENTRAL NIGER: Nupe 
В YORUBA-NORTHERN AKOKO: Yoruba 
y EDO: Bini 
8 LOWER NIGER: Igbo 
е JUKUNOID: Jukun 
t DELTA CROSS: Abua 
т LOWER CROSS: Efik 
0 UPPER CROSS: Legbo 
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tı BENUE-ZAMBESI: 
I CARA: Kambari 
П NYIMA: 
A PLATEAU: Birom 
B WEL: 
1 BENDI-BOKYI: Bendi 
2 BANTOID: 
a NON-BANTU: 
i MAMBILA-VUTE: Mambila 
ii TIV-BATU: Tiv 
b BROAD BANTU: 
i BANE: 
о JARAWAN-EKOID: Jarawa 
В МАМЕЕ: Esimbi 
y GRASSFIELDS: Bamun 
Š MENCHUM: Befang 
e BEBOID: Noni 
¿ TIKAR: Tikar 
ii NARROW BANTU: 
о NORTHWEST BANTU: 
Losengo 
B CENTRAL BANTU: Swahili, 
Rwanda, Rundi, Shona, 
Xhosa, Zulu 


3 NILO-SAHARAN 317 
I Songhai 
II SAHARAN: Kanuri 
HI MABAN: Mesalit 
IV FUR: Fur 
V EAST SUDANIC: 
A EASTERN: 
1 NUBIAN: Mahas 
2 SURMA: Me'en 
3 Nera 
4 EASTERN JEBEL: Gaam 
B WESTERN: 
1 NYIMANG: Nyimang 
2 TEMEIN: Ronge 
3 TAMA: Merarit 
4 DAJU: Sila 
C NILOTIC: 
1 WESTERN: Luo, Nuer 
2 EASTERN: Maasai 
3 SOUTHERN: Nandi 
D KULIAK: Ik 
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VI CENTRAL SUDANIC: 
A WEST CENTRAL: 
1 KRESH: Kresh 
2 BONGO-BAGIRMI: Gambai 
B EAST CENTRAL: 
1 MORU-MADI: Logbara 
2 MANGBETU-ASUA: Mangbetu 
3 MANGBUTU-EFE: Mamvu 
4 Balendru 
VII Berta 
VIII Kunama 
IX KOMUZ: 
A GUMUZ: Bega 
B KOMAN: Komo 


4 AFRO-ASIATIC 320 
I tAncient Egyptian 
II BERBER: Tamasheq, Kabyle, Shilha 
Ш CHADIC: Hausa, Margi 
IV OMOTIC: Ometo 
V CUSHITIC: Beja, Sidamo, Oromo, Somali 
VI SEMITIC: Aramaic, Hebrew, Arabic, Tigrinya, Amharic 


5 CAUCASIAN 324 
I SOUTH: Georgian 
II NORTH: 
A NORTHWEST: Kabardian, Abxaz 
B NORTHEAST: 
1 NAX: Chechen 
2 DAGESTAN: Avar, Dargwa 


6 INDO-HITTITE 325 
I tANATOLIAN: tHittite 
H INDO-EUROPEAN: 
A Armenian 
В TTOCHARIAN: tTocharian 
C INDO-IRANIAN: 
1 INDIC: Romany, Sinhalese, Hindi, Urdu, Punjabi, Bengali, Gujarati, 
Marathi, Shina 
2 NURISTANI: Kati 
3 IRANIAN: Farsi (= Persian), Pashto, Kurdish, Baluchi 
D Albanian 
E GREEK: Greek 
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F ITALIC: 
1 TOSCO-UMBRIAN: tOscan 
2 LATINO-FALISCAN: Sardinian, Rumanian, Italian, French, 

Provencal, Catalan, Spanish, Portuguese 
G CELTIC: Irish (= Gaelic), Breton, Welsh 
H GERMANIC: English, German, Yiddish, Dutch, Afrikaans, Danish, 
Swedish, Norwegian, Icelandic 

I BALTO-SLAVIC: 
1 BALTIC: Latvian, Lithuanian 
2 SLAVIC: Russian, Polish, Czech, Serbo-Croatian, Bulgarian 


7 URALIC-YUKAGHIR 328 
I YUKAGHIR: Yukaghir 
П URALIC: 
А SAMOYED: Nenets 
B FINNO-UGRIC: 
1 UGRIC: Hungarian 
2 FINNIC: 
a PERMIC: Komi, Udmurt 
b VOLGAIC: Mari, Mordvin 
c NORTH FINNIC: 
i SAAMIC: Saami (7 Lapp) 
ii BALTIC FINNIC: Finnish, Estonian 


8 ALTAIC 328 
I ALTAICPROPER: 
A TURKIC: Turkish, Azerbaijani, Uzbek, Kazakh 
B MONGOLIAN-TUNGUS: 
1 MONGOLIAN: Khalkha, Khorchin 
2 TUNGUS: Evenki, Manchu 
II KOREAN-JAPANESE: Korean, Ainu, Japanese 


9 СНОКСНІ-КАМСНАТКАМ 329 
I NORTHERN: Chukchi 
П SOUTHERN: Kamchadal 


10 ESKIMO-ALEUT 320 
I Aleut 
H ESKIMO: 
A INUIT: Greenlandic 
B YUPIK: Central Yupik 
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11 ELAMO-DRAVIDIAN 330 
I tElamite 
II DRAVIDIAN: 
A NORTHWEST: Brahui 
B DRAVIDIAN PROPER: 
1 NORTHEAST: Kurux 
2 CENTRAL: Telugu 
3 SOUTH: Kannada, Tamil, Malayalam 


12 SINO-TIBETAN 331 
І СІМІТІС: 
А BAI: Bai 
В CHINESE: Mandarin, Wu, Min, Yue (= Cantonese) 
H TIBETO-KAREN: 
А KAREN: Ра-о, Рһо, Sgaw 
В TIBETO-BURMAN: 
1 TIBETIC: Tibetan, Newari 
2 BARIC: Garo, Tangsa 
3 BURMIC: 
a KUKI-NAGA: Haka, Mizo, Sho 
b KACHIN-LUIC: Jinghpaw, Rawang 
c BURMESE-MOSO: 
i MOSO: Moso 
ii BURMESE-LOLO: 
о BURMIC: Burmese 
В LOLO: Lisu, Lahu 


13 AUSTRIC 334 
I MIAO-YAO: Miao, Mien (= Yao) 
II AUSTROASIATIC: 
A MUNDA: Mundari, Santali 
B MON-KHMER: 
1 NORTH: Khasi, Vietnamese 
2 EAST: Khmer (= Cambodian) 
3 SOUTH: Mon, Nicobarese 
Ш AUSTRO-TAI: 
A DAIC: 
1 LATI-GELAO: Lati 
2 LI-KAM-TAI: 
a LI-LAQUA: Li 
b ВЕ-КАМ-ТАГ: 
i LAKKIA-KAM-TAI: 
a Lakkia: 
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В KAM-TAI: 
I KAM-SUI: Kam, Sui 
II TAI: Thai (= Siamese), Lao (= Laotian) 
B AUSTRONESIAN: 
1 ATAYALIC: Atayal 
2 TSOUIC: Tsou 
3 PAIWANIC: Paiwan 
4 MALAYO-POLYNESIAN: 
a WESTERN MALAYO-POLYNESIAN: 
i NORTHERN PHILIPPINES: Ilokano, Pangasinan, 
Kapampangan 
ii SOUTHERN PHILIPPINES: Maranao, Magindanao 
iii MESO-PHILIPPINE: Tagalog, Cebuano, Hiligaynon 
iv SOUTH MINDANAO: Bagobo, Tboli 
v CELEBES: Bugis, Makassarese 
vi BORNEO: Malagasy, Melanau 
vii SAMA-BAJAW: Yakan, Sama 
viii SUNDIC: Malay, Indonesian, Bali, Sundanese, Achinese, 
Javanese 
b CENTRAL-EASTERN MALAYO-POLYNESIAN: 
i CENTRALMALAYO-POLYNESIAN: 
о CENTRAL MALUKU: Taliabo, Geser 
В SOUTHEAST MALUKU: Kei, Yamdena 
y TIMOR-FLORES: Sikka, Tetun, Letri Lgona 
3 WAIMA' A: Waima'a, Kairui 
е BIMA-SUMBA: Bima, Savu, Manggarai, Ngada 
ii EASTERN MALAYO-POLYNESIAN: 
a SOUTH HALMAHERA -NORTHWEST NEW GUINEA: 
I BOMBERAI: Kaiwai, Onin, Irahutu 
II SOUTH HALMAHERA-GEELVINK BAY: 
A SOUTH HALMAHERA: Gane, Misool, Waigeo 
B GEELVINK BAY: Waropen, Biak, Wandamen 
В OCEANIC: 
I SARMI-YOTAFA: Sobei, Yotafa 
II SIASSI: 
A SEPIK-MADANG: Sissano, Gedaged 
B RAI COAST-NORTHWEST NEW BRITAIN: 
Bariai 
III MARKHAM: Adzera, Mumeng 
IV MILNE BAY- CENTRAL PROVINCE: Bwaidoka, 
Motu 
V КІМЕЕ: Bileki, Vitu 
VI NEW BRITAIN: Arawe, Mengen 
УП NEW IRELAND-TOLAI: Tolai, Tigak 
VIII ADMIRALTY ISLANDS: Seimat, Gele 
IX BOUGAINVILLE: Nehan, Halia 
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K CHOISEUL: Vaghua, Babatana 
KI NEW GEORGIA: Simbo, Roviana 
XII SANTA ISABEL: Kia, Gao 
ХШ SANTA CRUZ: Nembao, Vano 
XIV SOUTHERN NEW HEBRIDES: Aneityum, Lenakel 
XV NEW CALEDONIA: Kumak, Numee 
XVI LOYALTY ISLANDS: Dehu, Nengone 
XVII REMOTE OCEANIC: 
A MICRONESIAN: Marshallese, Trukese 
B SOUTHEAST SOLOMONS: Gela, Arosi 
C CENTRAL & NORTHERN NEW HEBRIDES: 
Sakao, Motlav 
D CENTRAL PACIFIC: 
1 ROTUMAN-HIJIAN: Rotuman, Fijian 
2 POLYNESIAN: Samoan, Tahitian, 
Maori, Hawaiian 


14 INDO-PACIFIC 354 
I TTASMANIAN: TTasmanian 
II ANDAMAN ISLANDS: A-Pucikwar, Onge, Jarawa 
Ш TRANS-NEW GUINEA: 
A MAIN SECTION: 
1 CENTRAL & WESTERN: 
a FINISTERRE-HUON: Kate, Wantoat 
b EAST NEW GUINEA HIGHLANDS: Gadsup, Medlpa 
c CENTRAL & SOUTH NEW GUINEA-KUTUDBUAN: Foe, 
Asmat, Telefol 
d ANGAN: Angaataha, Baruya 
e GOGODALA-SUKI: Suki, Gogodala 
f MARIND: Boazi, Marind 
g KAYAGAR: Kaygir, Tamagario 
h SENTANI: Sentani, Demta 
i DANI-KWERBA: Dani, Kwerba 
j WISSEL LAKES-KEMANDOGA: Uhunduni, Ekagi 
k MAIRASI- TANAH MERAH: Mairasi, Tanah Merah, 
1 WEST BOMBERAI: Karas, Baham 
2 EASTERN: 
a BINANDEREAN: Binandere, Guhu-Semane 
b GOILALAN: Biangai, Kunimaipa 
с KOIARIAN: Koiari, Barai 
d KWALEAN: Kwale 
e MANUBARAN: Maria 
f YAREBAN: Yareba 
g MAILUAN: Magi 
h DAGAN: Daga 
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B MADANG-ADELBERT RANGE: Siroi, Waskia 
C TEBERAN-PAWAIAN: Dadibi, Pawaian 
D TURAMA-KIKORIAN: Kairi, Omati 
E INLAND GULF: Minanibai 
F ELEMAN: Toaripi, Orokolo 
G TRANS-FLY -BULAKA RIVER: Yelmek, Meriam 
H MEK: Nalca 
I SENAGI: Anggor 
J PAUWASI:Dubu 
K NORTHERN: Manem, Awyi 
L NIMBORAN: Nimboran 
M KAURE: Kaure 
N SOUTH BIRD'S HEAD: Konda, Barau 
O KOLOPOM: Kimaghana 
P TIMOR-ALOR-PANTAR: Oirata, Woisika 
IV WEST PAPUAN: Amberbaken, Ternate 
V EAST BIRD'S HEAD: Meax 
VI GEELVINKBAY: Yava 
VII SKO: Sko, Warapu 
ҮШ KWOMTARI-BAIBAI: Kwomtari, Baibai 
IX ARAI: Rocky Peak 
X AMTO-MUSIAN: Amto 
XI TORRICELLI: Au, Yahang, Arapesh, Monumbo 
XII SEPIK-RAMU: Iatmul, Angoram, Kambot 
XIII EAST PAPUAN: Savosavo, Nasici, Santa Cruz 


15 AUSTRALIAN 362 
I YIWAIDJAN: Iwaidja 
II MANGERRIAN: Mangerr 
III GUNWINYGUAN: Gunwinygu, Wagiman 
IV BURARRAN: Burarra 
V MARAN: Alawa 
VI WEST BARKLY: Djingili, Wambaya 
VII GARAWAN: Garawa 
VIII LARAGIYAN: Laragiya 
IX DALY: Maranungku, Malakmalak 
X DJAMINDJUNGAN: Djamindjung 
XI DJERAGAN: Kitja 
XII BUNABAN: Bunaba 
XIII NYULNYULAN: Nyulnyul 
XIV WORORAN: Ungarinjin 
XV PAMA-NYUNGAN: Wik-Mungkan, Dyirbal, Western Desert Language 
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l6 NA-DENE 365 
І Haida 
П CONTINENTAL NA-DENE: 
A Tlingit 
B ATHABASKAN-EYAK: 
1 Eyak 
2 ATHABASKAN: Tanaina, Chipewyan, Apache, Navajo 


17 AMERIND 366 
І МОКТНЕКМ AMERIND: 
А ALMOSAN-KERESIOUAN: 
1 ALMOSAN: 
а Kutenai 
b ALGIC: 
i RITWAN: TWiyot, Yurok 
ii ALGONQUIAN: Blackfoot, Cheyenne, Fox, Micmac 
c MOSAN: 
i CHIMAKUAN: Quileute 
ii WAKASHAN: Kwakwala (= Kwakiutl), Nootka 
iii SALISH: Bella Coola, Squamish, Kalispel 
2 KERESIOUAN: 
a KERESAN: Keres 
b SIOUAN-YUCHI: Yuchi, Catawba, Dakota, Winnebago 
c CADDOAN: Caddo, Wichita, Pawnee 
d IROQUOIAN: Mohawk, Seneca, Cherokee 
B PENUTIAN: 
1 CANADIAN: Tsimshian 
2 WASHINGTON: Chinook 
3 OREGON: Santiam, Coos 
4 PLATEAU: Klamath, Nez Perce 
5 CALIFORNIA: 
a WINTUN: Wintun 
b MAIDUAN: Mountain Maidu 
c YOKUTS: Foothill North Yokuts 
d MIWOK-COSTANOAN: tOhlone, Lake Miwok 
6 NEW MEXICO: Zuni 
7 GULF: 
a TTUNICA-CHITIMACHA: tTunica, tChitimacha 
b YUKI-WAPPO: Yuki, Wappo 
с NATCHEZ-MUSKOGEAN: Choctaw, Muskogee (= Creek), 
Alabama 
8 MEXICAN: 
a Huave 
b TOTONACAN: Totonac 
c MIXE-ZOQUE: Mixe, Zoque 
d MAYAN: Yucatec, Chontal, Tzeltal, Mam, Kekchi, Quiche 
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C HOKAN: 
1 NORTHERN: Shasta, t Yahi, Pomo 
2 Washo 
3 TSALINAN-CHUM ASH: tSalinan 
4 SERI-YUMAN: Seri, Yuma, Mohave, Havasupai 
5 tWAICURI-QUINIGUA: tQuinigua 
6 COAHUILTECAN: Tonkawa 
7 TEQUISTLATECAN: *Tequistlatec 
8 SOUTHERN: Jicaque, Tlapanec 
П CENTRAL AMERIND: 
A TANOAN: Kiowa, Taos 
B UTO-AZTECAN: Northern Paiute, Comanche, Luiseno, Hopi, Yaqui, 
Nahuatl (= Aztec) 
C OTO-MANGUEAN: Otomi, Mixtec, Mazatec, Chinantec 
HI CHIBCHAN-PAEZAN: 
A CHIBCHAN: Tarascan, Yanomami, Cuna, Tunebo 
B PAEZAN: Warao, Itonama, Mura, Embera, Paez, Cayapa 
IV ANDEAN: Quechua, Aymara, Zaparo, Mapudungu (= Araucanian), 
Qawasgar (= Alacaluf) 
V EQUATORIAL-TUCANOAN: 
A MACRO-TUCANOAN: Puinave, Nambikuara, Ticuna, Tucano 
B EQUATORIAL: Shuar, Ayoreo, Guarani, Guahibo, Chipaya, Arawak, 
Island Carib, tTaino 
VI GE-PANO-CARIB: 
A MACRO-CARIB: Yagua, Bora, Ocaina, Galibi (= Carib), Hishkaryana 
B GE-PANO: 
1 MACRO-PANOAN: Lengua, Chimane, Toba, Cashibo, Tacana 
2 MACRO-GE: Rikbaktsa, Bororo, Kaingang, Chavante, Cayapo 


18 LANGUAGE ISOLATES: Basque, Burushaski, Ket, Nahali 377 


19 UNCLASSIFIED: Warenbori, Талгар, Pauwi, Chiquitano 377 


20 PIDGINS AND CREOLES: French Creole, Sea Island Creole (- Gulla), Djuka, 
Tok Pisin, Chinook Jargon 377 


21 INVENTED: Esperanto, Interlingua 378 
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8.6 COMPLETE CLASSIFICATION 


This section offers a complete genetic classification of the world's lan- 
guages, arranged in terms of 17 independent phyla. The order in which 
these phyla are listed is for the most part geographic, beginning in Africa, 
running through Europe, Asia, and Oceania, and endingin the New World. 
Languages are given in capital-and-lower-case letters, language groups іп 
all capitals. Languages and language groups preceded by a dagger are ex- 
tinct. The number of extant languages in each genetic group is given in 
brackets following the name of the group. Those languages whose pho- 
nemic system (i.e., consonants and vowels) may be found in the forth- 
coming Volume 2 are preceded by an asterisk. 


KHOISAN [31]: 
I HADZA [1]: Hadza 
II SANDAWE [1]: Sandawe 
III SOUTHERN AFRICA [29]: 
A NORTHERN [4]: “Ora (= !Kung), #Au.//ei, Maligo, Ekoka 
B CENTRAL [19]: 
1 МАМА [3]: “Мата, Dama, !Ora, Kiri 
2 HALN//UM [1]: Hai.n//um 
3 KWADI [1]: Kwadi 
4 TSHU-KHWE [14]: 
a NORTHEAST [2]: *G//abake, Kwee 
b NORTH-CENTRAL [3]: Ganade, Shua, Danisin 
c CENTRAL [1]: Deti 
d NORTHWEST [4]: Buka, Handa, Xà, *G//ana 
e SOUTHWEST [4]: Naron, *G/wi, G//ani, N/hain.tse 
C SOUTHERN [6]: 
1 TA'A [4]: !Ó, ** Hà, N/amani, //Ng.!’e 
2 IWI [2]: *N/huki, //Xekwi, t/Xam 


cot 


NIGER-KORDOFANIAN [1,064]: 
I KORDOFANIAN [32]: 
A KADUGLI [9]: 
1 WESTERN [3]: Tulishi, Keiga, Kanga 
2 CENTRAL [4]: Miri, Kadugli, *Katcha, Tumma 
3 EASTERN [2]: Krongo, Tumtum 
B KORDOFANIAN PROPER [23]: 
1 KATLA [2]: "Katla, Tima 
2 HEIBAN [10]: 
a EASTERN [2]: “Ко, *Warnang 
b WEST-CENTRAL [8]: 
i CENTRAL [5]: 
[1]: “Кеге 
а EBANG-LOGOL [4]: 
[2]: "Utoro, *Logol 
І EBANG-LARU [2]: *Ebang, “ага 
ii SHIRUMBA [1]: *Shirumba 
ін WESTERN [2]: *Tiro, *Moro 
3 TALODI [6]: 
[1]: *Tegem 
a TALODI PROPER [5]: 
[3]: *Nding, *Jomang, *Tocho 
i NGILE-DENGEBU [2]: *Ngile, *Dengebu 
4 RASHAD [5]: *Tegali, Tagoi, Tingal, Tukum, Turum 
H NIGER-CONGO [1,032]: 
A MANDE [29]: 
[1]: Bobo-Fing 
1 NORTHERN-WESTERN [18]: 
a NORTHERN [13]: 
i SONINKE-BOZO [2]: *Soninke, Bozo 
ii SUSU-YALUNKA [2]: *Susu, Yalunka 


сос 


iii LIGBI-NUMU [2]: Ligbi, Мити 
iv VAI-KONO [2]: “Vai, Kono 
v MANDEKAN [5]: 
a WEST [2]: Mandinka, Xasonke 
p EAST [3]: *Maninka, *Bambara, *Dyula 
b SOUTHWESTERN [5]: 
[3]: *Loko, *Loma, *Kpelle 
i MENDE-BANDI [2]: *Mende, Bandi 
2 SOUTHERN-EASTERN [10]: 
a SOUTHERN [6]: 
[2]: Mano, *Dan 
i KWENI-TURA [4]: *Kweni, Mwa, Nwa, “Таға 
b EASTERN [4]: 
[2]: "Bisa, Busa 
i SAMO [2]: "Sane, "San 
В NIGER-CONGO PROPER [1,003]: 
1 WEST ATLANTIC [46]: 
[1]: Bijago 
a NORTHERN [29]: 
i SENEGAL [3]: *Fula (= Fulani), Serer, *Wolof 
ii CANGIN [5]: Falor, Lehar, Ndut, Non, Safan 
ш BAK [10]: 
a BALANT-GANJA [2]: *Balant, *Ganja 
B MANJAKU-PAPEL [3]: *Mankan, *Manjaku, Papel 
y DIOLA [5]: 
[1]: Bayot 
I DIOLA PROPER [4]: 
[2]: Karon, Kwatay 
A DIOLA-GUSILAY [2]: *Diola, Gusilay 
iv EASTERN SENEGAL- GUINEA BISSAU [8]: *Badyara, “Вапуип, “Вавагі, *Bedik, Biafada, Kasanga, 
Kobiana, Konyagi 
v MBULUNGISH-NALU [3]: Baga Mboteni, Mbulungish, Nalu 


tog 


[1 WEST ATLANTIC] 
b SOUTHERN [16]: 
[2]: Sua, Limba 
i MEL [14]: 
[1]: Gola 
œ TEMNE [8]: Baga Binari, Baga Koga, Baga Maduri, Baga Sitemu, Baga Sobane, Banta, Landuma, 
*Temne 
В BULLOM [5]: Bom, Northern Bullom, "Southern Bullom, Kisi, Krim 
2 CENTRAL NIGER-CONGO [957]: 
a NORTH CENTRAL NIGER-CONGO [206]: 
i KRU [18]: 
[3]: Aizi, Kuwaa, "Seme 
а EASTERN [3]: 
I NORTH [1]: *Bete 
H SOUTH [2]: Godie, Dida 
p WESTERN [12]: 
[4]: Bakwe, *Kru, *Grebo, Tajuason 
I KLAU [2]: Klau, Peripheral Klau 
II BASSA [2]: Bassa, Dewoin 
Ш WE [4]: 
[1]: Krahn 
A GUERE [3]: Guere, Wobe, Nyabwa 
ii DOGON [1]: *Dogon 
ш GUR (= VOLTAIC) [73]: 
[1]: *Bariba 
о KULANGO-LORHON [2]: Kulango, Lorhon 
В SENUFO-TUSYA [15]: 
[1]: *Tusya 
I SENUFO [14]: 
[8]: *Senari, Palaka, Nafana, Tyeliri, Tiefo, Kulele, Karaboro, Viemo 
A NATIORO-WARA [2]: Natioro, Wara 
B SUPPIRE-MIANKA [2]: *Suppire, *Mianka 
C TAGBANA-DYIMINI [2]: Tagbana, Dyimini 





y CENTRAL [55]: 
I KURUMFE-OTI-VOLTA [31]: 
[1]: Kurumfe 
А BWAMU-OTI-VOLTA [30]: 
[1]: Bwamu 
1 OTI-VOLTA [29]: 
a BULI-KOMA [2]: Buli, Koma 
b ОТІ-УОГТА PROPER [27]: 
i EASTERN [5]: Bieri, Wama, Tayari, *Tamari, Nyende 
ii WESTERN-GURMA [22]: 
о WESTERN [12]: Notre, *Gurenne, “More, Safalaba, *Dagara, *Kusaal, 
Kamara, Talne, Nabte, Mampelle, *Dagbani, Hanga 
B GURMA-YOM [10]: 
I GURMA [8]: *Kasele, Basari, "Konkomba, Soruba, *Dye, Gangam, 
*Gurma, *Moba 
II YOM-NAUDEM [2]: Yom, Naudem 
II DOGHOSE-GURUNSI [24]: 
A DOGHOSE-GAN [2]: Doghose, Gan 
B GURUNSI-KIRMA [22]: 
1 GURUNSI [18]: 
a WESTERN [4]: *Kasem, Nuni, *Lyele, Pana 
b CENTRAL-EASTERN [14]: 
i CENTRAL [8]: бін, Winye, Pwo, *Isala, Chakali, *Tampulma, Vagala, Mo 
ii EASTERN [6]: "Kabre, *Lamba, Tem, Chala, Delo, Bago 
2 KIRMA-LOBI [4]: 
a KIRMA-TYURAMA [2]: *Kirma, *Tyurama 
b LOBI-DAN [2]: *Lobi, “Dan 
iv ADAMAWA-UBANGIAN [114]: 
a ADAMAWA (66): 
[5]: Kam, *Longuda, Fali, Nimbari, Kim 
I WAJA [7]: 
[5]: Awak, Dadia, Kamu, Tula, Waja 
A CHAM-MONA [2]: Cham, Mona 


соЄ 


got 


[a NORTH CENTRAL NIGER-CONGO] 
H CHAMBA [3]: *Chamba, Donga, Wom 
Ш DAKA [4]: Daka, Dirrim, Gandole, Taram 
IV DURU [9]: *Doyáyo, Dupa, “Duru, Goom, Kolbila, Kutin, Sari, Sewe, Vere 
V MUMUYE-YENDANG [8]: 
А MUMUYE [5]: *Mumuye, Gengle, Kumba, Teme, Waka 
B YENDANG [3]: Yendang, Kugama, Passam 
VI MBUM-MUNDANG (12): 
[4]: *Galke, Mangbai, Pam, Tupuri 
A МВОМ [4]: *Mbum, Dek, Kali, Lakka 
B MUNDANG [2]: Mundang, Gelama 
С DAMA-MONO [2]: Dama, Mono 
VII YUNGUR [4]: 
[2]: Libo, Mboi 
A YUNGUR-ROBA [2]: Yungur, Roba 
VIII BAMBUKA [4]: Bambuka, Jen, Kanawa, Munga 
IX BUA [10]: Bua, Bolgo, Buso, Day, Fanya, *Gula, Kulaal, Koke, Nielim, Tunya 
p UBANGIAN [48]: 
I GBAYA-GBANZILI [28]: 
А GBAYA [4]: 
[2]: *Gbaya, Bangando 
1 MANZA-NGBAKA [2]: Manza, Ngbaka 
B NGBANDI [3]: *Ngbandi, *Sango, *Yakoma 
С GBANZILI-SERE [21]: 
1 SERE [9]: 
a NORTH [2]: *Feroge, Indri 
b SOUTH [7]: 
i NDOGO-SERE [3]: Ndogo, Sere, Tagbu 
ii BAI-VIRI [2]: Bai, *Viri 
ili MANGAYA-TOGOYO [2]: Mangaya, Togoyo 





Lo€ 


2 GBANZILI [12]: 
a EAST [3]: 
[1]: Mundu 
i MAYOGO-BANGBA [2]: Mayogo, Bangba 
b WEST [9]: 
[3]: *"Gbanzili, *Bwaka, *Monzombo 
i BAKA-GUNDI [6]: Baka, Bomasa, Bayanga, Ngombe, Gundi, Ganzi 
II BANDA [12]: 
A LANGBASI-NGUBU [2]: Langbasi, Ngubu 
B MBANZA-MBANJA [2]: Mbanza, Mbanja 
C NGBUNDU [1]: Ngbundu 
D NGUNDU-GOLO [7]: 
1 NGUNDU-KPAGUA [2]: Ngundu, Kpagua 
2 GUBU-YAKPA [2]: Gubu, Yakpa 
3 LINDA [1]: “Linda 
4 GBI-GOLO [2]: Gbi, Golo 
III ZANDE [4]: 
A ZANDE-NZAKARA [2]: *Zande, *Nzakara 
B BARAMBO-PAMBIA [2]: Barambo, Pambia 
IV AMADI [2]: Amadi, Dongo 
V MONDUNGA [2]: Mondunga, “Mba 
b SOUTH CENTRAL NIGER-CONGO [751]: 
i JO-DEFAKA [5]: 
[1]: Defaka 
a IJO [4]: *Kalabari, *Nembe, Biseni, *Kolokuma 
ii WESTERN [47]: 
a NYO [36]: 
I YI [34]: 
А VOLTA-COMOE (19): 


got 


[b SOUTH CENTRAL NIGER-CONGO] 
1 WESTERN [2]: Abure, Eotile 
2 CENTRAL [7]: 
[2]: *Akan, Abron 
a BAULE-AGNI [3]: *Baule, *Anyi, *Chakosi 
b NZEMA-AHANTA [2]: *Nzema, Ahanta 
3 EASTERN [10]: 
[7]: Awutu, Chiripong, "Gonja, "Nkonya, Nawuri, Nchumunu, Nchumburu 
a NORTH [3]: Achode, Anyanga, Krachi 
B CENTRAL TOGO [15]: 
1 МА [8]: Adele, Akpafu, Siwu, “Basila, *Buem, Likpe, Logba, Sele 
2 KA [7]: Ahlo, Animere, Avatime, Nyangbo, Bowili, Kebu, Kposo 
П GA-ADANGME [2]: “Са, *Adangme 
B TOGO [11]: 
I EWE-FON [2]: "Ewe, *Fon 
II LAGOON [9]: 
A POTU [2]: Mbato, *Ebrie 
В INTERIOR [5]: Attie, Abe, Abiji, Krobu, *Adyukru 
C COASTAL [2]: Aladian, Avikam 
iii EASTERN [699]: 
о CENTRAL NIGER (11): 
I NIGER-KADUNA [4]: 
[1]: *Ebira 
А МОРЕ [3]: 
[1]: *Gade 
1 NUPE-GWARI [2]: *Nupe, *Gwari 
H IDOMA [7]: Alago, *Idoma, *Yala, Akpa, *Igede, *Etulo, *Eloyi 
8 YORUBA-NORTHERN AKOKO [3]: 
[1]: Northern Akoko 
I YORUBA (2): *Yoruba, *Igala 
y EDO [22]: 
I DELTA [3]: *Epie, *Engenni, *Degema 


бос 


H SOUTHWESTERN [5]: ОКре,, *Urhobo, Uvbie, “ІБоКо, Eruwa 
III NORTH-CENTRAL [10]: *Bini, *Esan, Emai, “Iyekhee, Ghotuo, Uneme, North Ibie, Ikpeshi, 
Sasaru, Ososo 
IV NORTHWESTERN [4]: Okpamheri, Uhami, Ukue, Okpe, 
Š LOWER NIGER [7]: 
[1]: Ekpeye 
I IGBO [6]: *Izi, *Igbo, “Ка, *Ukwuani, Ogbah, Ikwere 
е JUKUNOID [10]: 
I YUKUBEN-KUTEB [2]: Yukuben, *Kuteb 
II CENTRAL [8]: 
A KPAN-ICEN [2]: "Kpan, Icen 
B JUKUN [3]: Ashuku, Nama, "Jukun 
C WURBO [3]: Bandawa, Chomo, Jiru 
¢ DELTA CROSS [11]: 
I CENTRAL DELTA [8]: Ogbogolo, *Ogbia, Kugbo, Mini, *Abua, *Odual, Ogbronuagum, Obulom 
II OGONI [3]: *Kana, *Gokana, *Eleme 
n LOWER CROSS [8]: 
[1]: Obolo 
І EFIK [3]: *Efik, *Ibibio, Anaang 
П ORON [4]: Okobo, Oron, Ekit, Ibino 
0 UPPER CROSS [22]: 
[1]: Tita 
I EASTERN [9]: *Mbembe, *Legbo, Leyigha, Lenyima, *Loko, Agoi, Lokukoli, Lubilo, Olulumo 
II WESTERN [8]: Koring, Kukele, Uzekwe, *Kohumono, Ubaghara, Agwaagwune, Umon, Ukpet 
III CALABAR RIVER [4]: Kiong, Korop, Lyoniyorg, Doko 
v BENUE-ZAMBESI [605]: 
I CARA [42]: 
A EASTERN [24]: 
[1]: Amo 
1 CHAWAI [2]: Piti, Chawai 
2 KURAMA [13]: Kuzamani, Kurama, Rumaya, Ruruma, Binawa, Kono, Surubu, Kaivi, 
Kiballo, Kitimi, Kinuku, Dungi, Gure 
3 JERA [8]: tKuda, Butu, Gyem, tTaura, *Lemoro, "Janji, Shani, Buji, "Anaguta, *Chokobo 


ore 


[I CARA] 
В WESTERN (18): 
[1]: Reshe 
1 LARU-LOPA [2]: Laru, Гора 
2 КАМВАКІ-ОГКА |5): 
а КАМВАКІ [2]: Western Kambari, Eastern Kambari 
b KAMUKU [2]: Ngwoi, Kamuku, TBassa-Kontagora, tAshaganna 
с BASSA (7): Bassa-Kaduna, Bassa-Kuta, Gurmana, Pongu, Bauchi, Ura, Bassa-Kwomu 
а DUKA [4]: Lela, “Duka, Puku, Lyase 
H NYIMA (563): 
А PLATEAU (44): 
[4]: Ayu, Yashi, Mabo-Barkul, Ігіруге 
1 BIROM-MIGILI (А): 
[2]: Migili, Aten 
а BIROM [2]: *Birom, Fachara 
2 KAJE-KADARA [16]: 
Ш: Katab 
а YESKWA [3]: Yeskwa, Lungu, Koro 
b KAJE [6]: Kamanton, Kagoma, Jaba, Nandu, Izarek, Kaje 
с KADARA [6]: Kadara, Kuturmi, Ikulu, Idong, Doka, Iku 
3 NINZAM-RUKUBA [9]: 
[1]: Rukuba 
a KWANKA [2]: Kwanka, Shall 
b NINZAM [6]: Ninzam, Mada, Gwantu, Nindem, Kaningkon, Kanufi 
4 EGGON [4]: Eggon, Nungu, Ake, Jidda 
5 FYAM [2]: "Fyam, Horom 
6 TAROK [3]: *Tarok, Bashar, Pai 
7 TURKWAM [2]: Turkwam, Arum 
B WEL [519]: 
1 BENDI-BOKYI [9]: 
[1]: Bokyi 
a BENDI [8]: Bekwarra, Bendi, Obanliku, Ukpe, Ubang, Alege, Utugwang, Bumaji 


TIE 


2 BANTOID [510]: 
а NON-BANTU [16]: 
i MAMBILA-VUTE [8]: *Mambila, Kamkam, Tep, Magu, Kila, Ndoro, Gandua, 
*Vute 
іі TIV-BATU [8]: “Тіу, Otank, Emane, Icheve, Evant, Bitare, Abon, Batu 
b BROAD BANTU [494]: 
i ВАМЕ [114]: 
а JARAWAN-EKOID [20]: 
I JARAWAN [10]: 
[2]: Mboa, Nagumi 
А NIGERIAN (8): Bile, Mbula, Mama, Lame, Jaku, Gubi, Kulung, 
*Jarawa 
II EKOID-MBE [10]: 
[1]: “Mbe 
A EKOID [9]: Ndoe, *Ejagham, Efutop, Мае, *Abanyom, Nkem, 
Nkumm, Nnam, Ekajuk 
В МАМЕЕ (9): Esimbi, Amasi, Kenyang, Kitwii, Kinkwa, Menka, Ngwo, 
Bitieku, Assaka 
y GRASSFIELDS [68]: 
I EASTERN [48]: 
А МКАМВЕ [8]: Адеге, Limbum, Kofa, Kwaja, Ncha, Ndaktup, 
Ntem, Yamba 
B NUN-BAMILEKE [40]: 
1 NUN [6]: *Bamun, Mungaka, Bambalang, Baba, Bapi, Bandeng 
2 NGEMBA [8]: *Mankon, Mundum, Bafut, Nkwen, Bambui, 
Pinyin, Awing, Kpati 
3 BAMILEKE [26]: Babaju, Fe'fe, "Dschang, Bafussam, Bagam, 
Balum, Bamaha, Bamendiinda, Bamenkombit, Bamenyam, 
Banjun, Bangangte, Bangwa, Bapi, Bacham, Bachingu, Bati, 
Batie, Bamugun, Fomopea, Fongondeng, Foto, Fotuni, *Ngwe, 
Babuantu, Upper Mundani 


гіс 


li BANE] 
II WESTERN (20): 
А RING [15]: 
1 МКОМ [8]: 
а WEST [4]: *Aghem, Isu, Weh, Bafumen 
b CENTRAL [4]: Babanki, Kom, Bum, Oku 
2 EAST [Л]: 
[1]: *Lamnso 
a NDOP [6]: Bamessing, Bamessi, Babungo, Bamunka, 
Bamali, Fanji 
B MOMO-NJEN [5]: 
1 МОМО [3]: Tadkon, Ngie, Oshie 
2 NJEN [2]: Njem, Lower Mundani 
Š MENCHUM [6]: Modele, Mukuru, Befang, Bangui, Obang, Okomanjan 
е BEBOID [9]: 
I WESTERN [4]: Mekaf, Bu, Kinabe, Koshin 
II EASTERN [5]: Bebe, Dumbo, Nchanti, Noni, Akweto 
& TIKAR [2]: *Tikar, Bandobo 
ii NARROW BANTU [380]: 
a NORTHWEST BANTU [131]: 
I ZONE A [43]: 
A LUNDU-BALONG [5]: Lundu, Lue, Balong, Bonkeng, *Bafo 
B DUALA [7]: Mboko, Kwiri, Su, *Duala, Wori, Pongo, Limba 
C BUBI-BENGA [4]: Bubi, Noho, *Yasa, *Benga 
D BASAA [6]: Lombi, Bankon, *Basaa, *Tunen, Nyo'o, Mande 
E BAFIA [4]: Fa', Mbong, "Kpa, Djanti 
F SANAGA [2]: Ki, Yambasa 
G EWONDO [5]: Eton, *Ewondo, Bebele, “Bulu, *Fang 
H MAKAA-NJEM [7]: *Ngumba, бо, Makaa, Njem, Konabem, 
Mpiemo, Bomwali 
I KAKO [3]: Kwakum, Pol, *Yaka 
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П ZONE B [41]: 
А МҮЕМЕ [1]: “Муепе 
В КЕЕ [7]: Sekyani, Kele, Mbangwe, Wumvu, Kota, Ndasa, Sighu 
С TSOGO [3]: Tsogo, Капде, Pinji 
D SIRA [6]: Sira, Sangu, Punu, Lumbu, Mbwisi, Вагатла 
E NJEBI [3]: Duma, *Njebi, Tsaangi 
F МВЕРЕ [6]: Mbede, Mbama, *Ndumu, Ngwii, Ngul, Kaningi 
G TEKE [9]: Ngu Ngwoni, North Teke, Northeast Teke, West Teke, 
Central Teke, Bali, East Teke, *South Teke, Wuumu 
H YANZI [6]: Tiene, Mfinu, Mpuono, Yanzi, Di, Mbuun 
HI ZONE C [47]: 
A NGANDO [2]: *Ngando, Kota 
B NGUNDI [6]: Ngundi, Pande, *Mbati, Mbomotaba, Bongili, Lobala 
C MBOSHI [7]: Mboko, *Akwa, Ngare, Koyo, Mboshi, Kwala, Kuba 
D BANGI-NTOMBA [7]: “Ngiri, *Bobangi, Sengele, Sakata, *Ntomba, 
*Losengo, Buja 
E NGOMBE [5]: “Ngombe, Bwela, Bati, Bua, Beo 
F KELE [6]: Mbesa, So, Gesogo, Lombo, Lokele, Foma 
G MONGO [6]: *Mongo, Lalia, Ngando, Konda, Mbole, *Ombo 
H TETELA [5]: *Tetela, Kusu, Nkutu, Yela, *Kela 
I BUSHONG [3]: Ndengese, Songomeno, *Bushong 
В CENTRAL BANTU [249]: 
I ZONE D [17]: 
A ENYA [3]: Lengola, Mituku, Enya 
B LEGA-KALANGA [5]: Bali, North Binja, Lega, South Binja, 
*Holoholo 
С BIRA-HUKU [6]: Piri, Bira, Nyali, Bodo, Komo, Kare 
D NYANGA [1]: “Nyanga 
E BEMBE [2]: Bembe, Buyu 


PIC 


[8 CENTRAL BANTU] 


П ZONE E [22]: 
A LOGOLI-KURIA [7]: Logoli, *Gusii, Kuria, Zanaki, Nata, Sonjo, 
Ngurimi 
B KIKUYU-KAMBA [6]: *Kikuyu, Embu, Meru, *Tharaka, *Kamba, 
Daiso 
C CHAGA [5]: Rwo, Chaga, Rusa, Kahe, Gweno 
D NYIKA [4]: Pokomo, Nyika, Digo, *Taita 
Ш ZONE F [13]: 
A TONGWE [3]: Tongwe, Bende, Fipa 
B SUKUMA-NYAMWEZI [5]: *Sukuma, Nyamwezi, Sumbwa, Kimbu, 
Bungu 
С NYILAMBA-LANGI [5]: Nyilamba, *Rimi, Langi, Mbugwe, Mbugu 
IV ZONE G [28]: 
А GOGO [2]: Gogo, Kaguru 
В SHAMBALA [4]: Tubeta, Casu, Shambala, *Bondei 
C ZIGULA-ZARAMO [9]: Zigula, Ngwele, Zaramo, Ngulu, Ruguru, 
Kami, Kutu, Vidunda, Sagala 
D SWAHILI [4]: Tikuu, “Swahili, Pemba, “Komoro 
E POGORO [2]: Pogoro, Ndamba 
F ВЕМА-КІМСА [7]: Sango, Hehe, Bena, Pangwa, Kinga, Wanji, Kisi 
V ZONE H [17]: 
A KONGO [7]: *Bembe, *Yombe, Kunyi, Ndingi, Mboka, *Kongo, 
Dondo 
B MBUNDU [4]: *North Mbundu, Sama, Bolo, Songo 
C YAKA [5]: Yaka, Suku, Hungu, Mbangala, Sinji 
D НОМСАМА [1]: Hungana 
VI ZONE J [30]: 
А NYORO-LUGANDA [5]: "Nyoro, *Luganda, Soga, Gwere, Nyala 
B HAYA-KWAYA |5|: Karagwe, Haya, Dzindza, Kerebe, Kwaya 
С MASABA-LUYIA [6]: “Masaba, *Luyia, *Nyole, Suamia, Муин, 
*Syan 
D KONZO [2]: *Konzo, Nande 
E SHI-HAVU [7]: Hunde, Havu, Shi, Fuliru, Vira, Kabwari, Tembo 
F RWANDA-RUNDI [5]: “Rwanda, “Кап, Shubi, Hangaza, Vinza 


CIE 


ҮП ZONE K [26]: 
A HOLU [2]: Yeci, Holu 
B CHOKWE-LUCHAZI [9]: Chokwe, *Luimbi, Luchazi, *Lwena, 
Mbuunda, Nyengo, Mbwela, Nkangala, Ngangela 
C SALAMPASU-NDEMBO [3]: Salampasu, Ndembo, Ruund 
D KWANGWA [8]: *Luyi, Mbowe, *Kwangali, Masi, Simaa, Sanjo, 
Kwangwa, Gova 
E SUBIA [2]: Totela, *Subia 
F MBALA [1]: Mbala 
G DIRIKU [1]: Diriku 
VIII ZONE L [15]: 
A BWILE [1]: Bwile 
B SONGYE [6]: Kete, Mbagani, Songye, Luna, Budya, *Bangubangu 
C LUBA [5]: *Luba-Kasai, Kanyok, *Luba-Shaba, Hemba, Sanga 
D KAONDE [1]: Kaonde 
E NKOYA [2]: Mbwera, Nkoya 
IX ZONE M [24]: 
A RUNGU [4]: Pimbwe, Rungwa, Rungu, “Mambwe 
B NYIKA-SAFWA [7]: *Ndali, Mwanga, Nyiha, Malila, Safwa, Iwa, 
Tembo 
С NYAKYUSA [1]: *Nyakyusa 
D BEMBA [3]: Taabwa, Bemba, Aushi 
E BISA-LAMBA [5]: *Bisa, Lala, Swaka, Lamba, Seba 
F LENJE-TONGA [4]: Lenje, Soli, *Ila, Tonga 
X ZONE N [12]: 
A MANDA [4]: Manda, Matengo, Mpoto, Tonga 
B TUMBUKA [1]: Tumbuka 
C NYANJA [1]: “Nyanja 
D SENGA-SENA [6]: Senga, Kunda, *Nyungwe, *Sena, Кие, Podzo 
XI ZONE P [14]: 
A MATUMBI [5]: Ndengereko, Ruihi, Matumbi, Ngindo, Mbunga 
B YAO [5]: "Yao, “Mwera, Makonde, Ndonde, Mabiha 
C MAKUA [4]: *Makua, Lomwe, Ngulu, Chuabo 


grt 


(8 CENTRAL BANTU] 
ХП ZONE R [9]: 


A SOUTH MBUNDU [4]: *South Mbundu, Ndombe, Nyaneka, 
*Nkumbi 


B NDONGA [4]: Kwanyama, *Ndonga, Kwambi, Ngandyera 
C HERERO [1]: Herero 
ХШ ZONE S [22]: 
А SHONA [6]: *Shona, *Zezuru, Manyika, "Karanga, Маап, Kalanga 
B VENDA [1]: *Venda 
C SOTHO-TSWANA [4]: *Tswana, *Northern Sotho, *Southern Sotho, 
*Lozi 
D NGUNI [5]: *Xhosa, "Zulu, *Swati, *Ndebele, Ngoni 
E TSWA-RONGA [4]: Tswa, *Tonga, *Tsonga, *Ronga 
F CHOPI [2]: *Chopi, Tonga-Inhambane 
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NILO-SAHARAN (138): 
I SONGHAI [1]: *Songhai 
H SAHARAN [5]: 
A WESTERN [3]: 
1 KANURI [2]: *Kanuri, Kanembu 
2 TUBU [1]: *Tubu 
B EASTERN [2]: Zagawa, Berti 
III MABAN [6]: 
[2]: Mimi,, Mimi, 
A MABANG [4]: 
[2]: *Mabang, Mesalit 
1 RUNGA-KIBET [2]: Runga, Kibet 
IV FUR [2]: *Fur, Biltine 
V EAST SUDANIC [80]: 
A EASTERN [18]: 
1 NUBIAN [5]: 
a WESTERN [1]: *Meidob 
b CENTRAL [3]: *Birgid, *Debri, *Dongolawi 
c NORTHERN [1]: *Mahas 
2 SURMA [$8]: 
[3]: *Murle, Bale, Zilmamu 
a MAJANG-SHABO [2]: Majang, Shabo 
b KWEGU [3]: Kwegu, *Suri, Me'en 
3 NERA [1]: “Мега 
4 EASTERN JEBEL [4]: 
[1]: *Gaam 
a AKA-KELO-MOLO [3]: Aka, Kelo, Molo 
B WESTERN [15]: 
1 NYIMANG [2]: *Nyimang, *Dinik 
2 TEMEIN [3]: *Ronge, Doni, Dese 
3 TAMA [3]: 
[1]: Merarit 
a TAMA-SUNGOR [2]: *Tama, *Sungor 
4 DAJU [7]: 
a EASTERN [2]: *Shatt, Liguri 
b WESTERN [5]: Mongo, Sila, Nyala, Lagawa, Nyalgulgule 
C NILOTIC [44]: 
1 WESTERN [21]: 
a LUO [14]: 
i NORTHERN [7]: 
[5]: *Shilluk, *Anuak, Jur, Thuri, Bor, 
a MABAN-BURUN [2]: Maban, Burun 
ii SOUTHERN [7]: 
[3]: Labwor, Lwo, Adhola 
a LUO-ACHOLI [4]: 
[1]: “Luo 
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I ALUR-ACHOLI [3]: 
[1]: *Alur 
A LANGO-ACHOLI [2]: Lango, *Acholi 
b DINKA-NUER [7]: 
i NUER [2]: *Nuer, Atuot 
ii DINKA [5]: Agar, *Bor,, Padang, “Кек, Kumam 
2 EASTERN [9]: 
a BARI [2]: “Вагі, Kakwa 
b LOTUXO-TESO [7]: 
i LOTUXO-MAA |3): 
a LOTUXO [1]: *Lotuxo 
В ONGAMO-MAA [2]: Ongamo, “Maasai 
ii TESO-TURKANA [4]: Karamojong, Topotha, Turkana, “Teso 
3 SOUTHERN [14]: 
a KALENJIN [12]: 
i NANDI- 
MARKWETA [5]: Nandi, *Kipsikiis, Keiyo, Tuken, Markweta 
ii ELGON [5]: 
[2]: Sapiny, Kony 
a POK-TERIK [3]: Рок, Ng'oma, Terik 
iii OKIEK [1]: Okiek 
iv PAKOT [1]: *Pakot 
b TATO [2]: Omotik, Datooga 
D KULIAK [3]: 
[1]: Ik 
1 NGANGEA-SO [2]: Ngangea, *So 
VI CENTRAL SUDANIC [36]: 
A WEST CENTRAL [24]: 
1 KRESH [2]: Kresh, Aja 
2 BONGO-BAGIRMI [22]: 
а BONGO [4]: Baka, “Вей, *Bongo, Morokodo 
b SINYAR [1]: Sinyar 
c SARA-BAGIRMI [14]: 
i SARA [6]: 
[1]: Vale 
a SARA PROPER [5]: *Gambai, Kaba, Kaba Dunjo, *Mbai, 
“бага 
ii BAGIRMI [8]: Babalia, *Bagirmi, Fongoro, Disa, Gele, Gula, 
*Kenga, Kuka 
d KARA [3]: Еши, “Kara, *Yulu 
B EAST CENTRAL [12]: 
1 MORU-MADI [6]: 
a NORTHERN [1]: *Moru 
b CENTRAL [4]: Avukaya, Kaliko, Logo, *Logbara 
c SOUTHERN [1]: *Madi 
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2 MANGBETU-ASUA [2]: *Mangbetu, Asua 
3 MANGBUTU-EFE [3]: *Mamvu, Mangbutu, Ndo 
4 BALENDRU [1]: *Balendru 
ҮП BERTA [1]: *Berta 
VIII KUNAMA [1]: *Kunama 
IX KOMUZ [6]: 
A GUMUZ [1]: Bega 
B KOMAN [5]: 
1 ANEJ [1]: Anej 
2 KWAMA-OPO [4]: 
a KWAMA [1]: Kwama 
b KOMO-OPO [3]: 
i OPO [1]: Opo 
ii KOMO-TWAMPA [2]: *Komo, *Twampa 
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AFRO-ASIATIC |241): 
I tANCIENT EGYPTIAN [0]: *FAncient Egyptian, *tCoptic 
II BERBER [30]: 
A tGUANCHE [0]: tGuanche 
B tEAST NUMIDIAN [0]: tEast Numidian (= Old Libyan) 
C BERBER PROPER [30]: 
1 EASTERN [4]: 
[1]: Siwa 
a AWJILA-SOKNA [3]: Awjila, Sokna, Ghadames 
2 TUAREG (3): 
a NORTHERN [1]: Tamahaq 
b SOUTHERN [2]: Tamazheq, *Tamasheq 
3 WESTERN [1]: *Zenaga 
4 NORTHERN [22]: 
a ATLAS [2]: *Shilha, *Tamazight 
b KABYLE [1]: *Kabyle 
с ZENATI [19]: 
[7]: Shawiya, Tidikelt, Tuat, *Riff, Ghmara, Tlemcen, Sheliff 
Basin 
i MZAB-WARGLA [5]: Gurara, Mzab, Wargla, Ghardaia, Tugurt 
ii EAST ZENATI [7]: Tmagurt, Sened, Jerba, Tamezret, Taujjut, 
Zwara, Nefusi 
Ш CHADIC (123): 
А MASA [5]: Masa, Zime, Mesme, Marba, *Musey 
B EAST [28]: 
1 GROUP A [13]: 
a SOMRAI [7]: 
i GROUP 1 [3]: Somrai, *Tumak, Маат 
ii GROUP 2 [4]: Sarwa, Gadang, Mod, Miltu 
b NANCERE [4]: 
i GROUP 1 [2]: Nancere, Lele 
ii GROUP 2 [2]: Gabri, Kabalai 
c KERA [2]: Kera, Kwang 
2 GROUP B [15]: 
[1]: Mokulu 
a DANGLA [11]: 
i GROUP 1 [6]: *Dangla, Migama, Mahwa, Jegu, Mogum, Bidiyo 
ii GROUP 2 [5]: Mubi, Masmaje, Kajakse, Kujarge, Toram 
b SOKORO [3]: Sokoro, Barain, Saba 
C BIU-MANDARA [37]: 
1 GROUP A [32]: 
a TERA [4]: 
i GROUP 1 [2]: *Tera, Jara 
ii GROUP 2 [2]: *Ga'anda, Hona 
b BURA-MANDARA [21]: 
i BURA-HIGI [7]: 
о BURA [5]: 
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I GROUP 1 [3]: “Bura, Chibak, Putai 
H GROUP 2 [2]: *Margi, *Kilba 
B HIGI [2]: *Higi, Bana 
ii MANDARA-MATAKAM [14]: 
[1]: Sukur 
a MANDARA [7]: 
[1]: Lamang 
I MANDARA PROPER [6]: 
[2]: Mandara, Paduko 
A GLAVDA [4]: *Glavda, Guduf, Dghwede, Gvoko 
В MATAKAM [6]: Mafa, "Mofu, Gisiga, Mada, Hurza, 
Muktele 
c DABA [3]: *Daba, Gawar, Hina 
d BATA [4]: 
[1]: Gudu 
i BATA PROPER [3]: *Bata, *Gude, Nzangi 
2 GROUP В [5]: 
[2]: *Musgu, Gidar 
а КОТОКО [3]: 
[1]: *Buduma 
i KOTOKO PROPER (2): Kotoko, Logone 
D WEST [53]: 
1 GROUP A [32]: 
a HAUSA [2]: *Hausa, *Gwandara 
b BOLE-ANGAS [24]: 
i BOLE-TANGALE [14]: 
а BOLE [10]: 
[1]: Karekare 
I BOLE PROPER [9]: *Bole, Bele, Ngamo, *Maha, 
Kirfi, Deno, Kubi, Galambu, Gera 
В TANGALE [4]: 
[1]: *Kanakuru 
I TANGALE PROPER [3]: Tangale, Pero, Kupto 
ii ANGAS [10] 
[1]: Gerka 
а ANGAS PROPER [9]: 
I GROUP 1 [4]: *Angas, *Sura, *Kofyar, Chip 
II GROUP 2 [5]: *Goemai, Montol, Tal, Pyapun, 
Koenoem 
c RON [6]: 
[1]: Fyer 
i RON PROPER [5]: Ron, Sha, Kulere, Karfa, Shagawu 
2 GROUP B [21]: 
a BADE-WARJI [12]: 
i BADE [3]: 
[1]: Duwai 
a BADE PROPER [2]: Bade, *Ngizim 
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ii WARJI [9]: Warji, *Pa'a, Siri, Diri, Jimbin, Miya, Mburku, 
Kariya, Tsagu 
b ZAAR [9]: 
i ZAAR PROPER [5]: *Zaar, Barawa, Zeem, Polchi, Geji 
ii GURUNTUM [2]: Guruntum, Ju 
iii BOGHOM [2]: Boghom, Mangas 
IV OMOTIC [34]: 
A SOUTH [3]: “Натаг, Ari, Dime 
B NORTH [31]: 
1 DIZOID [4]: *Dizi, Dorsha, Shako, Nao 
2 MAO [4]: Bambeshi, Hozo, Sezo, Ganza 
3 GONGA-GIMOJAN [23]: 
a GONGA (8): 
i SOUTH [3]: "Kafa, “Мосһа, Bosha 
ii CENTRAL [1]: Anfillo 
Hi NORTH [4]: Naga, Guba, Boro, Amuru 
b GIMOJAN [15]: 
i JANJERO [2]: *Janjero, Fuga 
ii OMETO-GIMIRA [13]: 
a GIMIRA [2]: Bencho, She 
B CHARA [1]: *Chara 
y OMETO [10]: 
I WEST [1]: *Basketo 
II SOUTH [1]: Male 
ІШ CENTRAL [2]: Oyda, *Ometo 
IV EAST [6]: *Zayse, Zergulla, Ganjule, Gidiccho, 
Gatsame, Koyra 
V CUSHITIC [35]: 
A BEJA [1]: *Beja 
B CUSHITIC PROPER [34]: 
1 CENTRAL CUSHITIC [5]: 
a SOUTH [1]: *Awngi 
b NORTH [4]: 
[2]: Xamir, *Kemant (= Falasha) 
i BILIN-XAMTA [2]: *Bilin, Xamta 
2 EASTERN CUSHITIC [22]: 
a HIGHLAND (Б): 
[1]: *Burji 
i SIDAMO [4]: 
[2]: *Sidamo, *Gedeo 
о KAMBATA-HADIYYA [2]: *Kambata, *Hadiyya 
b YAAKUAN-DULLAY [5]: 
i YAAKUAN [1]: *Yaaku 
ii DULLAY [4]: Harso, *Gawwada, Tsamay, Birale 
c LOWLAND [12]: 
i AFAR-SAHO [2]: *Afar, Saho 
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ii OMO-TANA [6]: 
a GALABOID [2]: *Dasenech, tElmolo, Arbore 
В BAYSO [1]: *Bayso 
y SAM [3]: 
[1]: *Rendille 
I SOMALI-AWEER [2]: *Somali, *Aweer 
iii OROMOID [4]: 
[1]: *Oromo 
а KONSOID [3]: Bussa, *Gidole, *Konso 
3 SOUTHERN CUSHITIC [7]: 
a DAHALO [1]: *Dahalo 
b MA'A [1]: *Ma'a 
c RIFT [5]: 
i EAST [2]: Asa, Kw'adza 
ii WEST [3]: "Iraqw, Alagwa, Burunge 
VI SEMITIC [19]: 
[0]: tEblaic 
А tEAST [0]: *tAkkadian 
B WEST [19]: 
1 CENTRAL [6]: 
a ARAMAIC [2]: *tOld Aramaic, tSyriac, *Assyrian, Aramaic 
b ARABO-CANAANITE [4]: 
i CANAANITE [1]: *tClassical Hebrew, *Hebrew, tPhoenician, 
*tUgaritic, tMoabite 
ii ARABIC [3]: *tClassical Arabic, *Eastern Arabic, *Western 
Arabic, *Maltese 
2 SOUTH (13): 
[1]: tEpigraphic South Arabian, *South Arabian 
a ETHIOPIC [12]: 
i NORTH [2]: 
[0]: *rGeez 
a NORTH PROPER [2]: *Tigre, *Tigrinya 
ii SOUTH [10]: 
а TRANSVERSAL [4]: 
I AMHARIC-ARGOBBA [2]: *Amharic, Argobba 
II EAST GURAGE-HARARI [2]: *Harari, East Gurage 
p OUTER [6]: 
I N-GROUP [2]: 
[0]: *TGafat 
A N-NORTH GURAGE [2]: Soddo, Goggot 
II TT-GROUP [4]: 
A TT-NORTH GURAGE [1]: *Muher 
b WEST GURAGE [3]: 
[1]: Masqan 
1 CENTRAL & PERIPHERAL [2]: *Central 
West Gurage, Peripheral West Gurage 
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CAUCASIAN [38]: 
I SOUTH [4]: 
[2]: "Georgian, *Svan 
А ZAN [2]: *Mingrelian, Laz 
II NORTH [34]: 
A NORTHWEST [5]: 
[1]: *Ubyx 
1 ABXAZ-ABAZA [2]: *Abxaz, *Abaza 
2 CIRCASSIAN [2]: *Adygh, *Kabardian 
В NORTHEAST [29]: 
1 МАХ [3]: 
[1]: *Bats 
a CHECHEN-INGUSH [2]: *Chechen, *Ingush 
2 DAGESTAN [26]: 
a AVARO-ANDI-DIDO [14]: 
i АМАК [1]: *Avar 
ii ANDI [8]: "Andi, *Botlix, *Godoberi, *Chamalal, *Bagulal, 
*Tindi, “Karata, *Axvax 
Hi DIDO [5]: 
[1]: *Xvarshi 
а DIDO-HINUX [2]: "Dido, *Hinux 
B BEZHTA-HUNZIB [2]: *Bezhta, *Hunzib 
b LAK-DARGWA [2]: *Lak, *Dargwa 
c LEZGIAN [10]: 
[2]: *Archi, *Xinalug 
i LEZGIAN PROPER [8]: *Lezgi, *Tabasaran, *Agul, *Rutul, 
*Tsaxur, *Kryts, *Budux, "Udi 
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INDO-HITTITE [144]: 
I TANATOLIAN [0]: 
[0]: *tHittite, TPalaic, TLydian 
А tLUWOID [0]: tLuwian, TLycian 
H INDO-EUROPEAN [144]: 
A ARMENIAN [1]: *tClassical Armenian, "Armenian 
B TTOCHARIAN [0]: *tTocharian A, *tTocharian В 
C INDO-IRANIAN [93]: 
1 INDIC [48]: 
[0]: *tSanskrit 
a ROMANY [1]: *Romany 
b SINHALESE-MALDIVIAN [3]: *Maldivian, *Sinhalese, Vedda 
c NORTHERN INDIA [44]: 
i DARDIC [16]: 
а CENTRAL [5]: *Bashkarik, Maiya, Tirahi, Torwali, 
*Wotapuri 
В CHITRAL [2]: Kalasha, Khowar 
y KUNAR [5]: *Dameli, *Gawar-bati, Shumashti, 
Nangalami, *Pashai 
Š SHINA [4]: *Dumaki, *Phalura, Shina, “Kashmiri 
ii WESTERN [4]: 
о SOUTH [2]: *Marathi, *Konkani 
B NORTHWEST [2]: “Sindhi, *Lahnda 
iii CENTRAL [16]: Parya, Baluj, *Punjabi, *Marwari, *Banjari, 
*Malvi, *Gade Lohar, *Gujarati, *Bhili, Khandeshi, “Hindi, 
"Urdu, Dogri, Western Pahari, *Garhwali, *Kumauni 
iv EAST-CENTRAL [5]: 
[2]: *Nepali, *Awadhi 
a BIHARI [3]: *Bhojpuri, *Maithili, Magahi 
v EASTERN [3]: 
[1]: *Oriya 
a BENGALI-ASSAMESE [2]: “Bengali, *Assamese 


2 NURISTANI [5]: *Ashkun, Kalasha-ala, *Kati, Tregami, Wasi-weri 
3 IRANIAN [40]: 


a EASTERN [12]: 
i NORTHEAST [10]: 
a WEST SCYTHIAN [1]: tScythian, *Ossetic 
В tKHWAREZMIAN [0]: *tAvestan, tKhwarezmian 
y SOGDIAN [1]: tSogdian, *Yaghnobi 
8 tBACTRIAN (0): tBactrian 
е EAST SCYTHIAN [8]: 
[1]: tSaka, *Pashto 
І PAMIR [7]: 
А WAKHI [1]: *Wakhi 
В MUNJI-YIDGHA [2]: *Munji, *Yidgha 
C SANGLECHI-ISHKASHMI [2]: *Sanglechi, 
*Ishkashmi 
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D SHUGHNI-YAZGULAMI [2]: *Shughni, 
*Yazgulami 
ii SOUTHEAST [2]: *Parachi, *Ormuri 
b WESTERN [28]: 
i NORTHWEST [21]: 
[0]: tMedian, tParthian 
а CENTRAL IRAN [8]: *Yazdi, Nayini, Natanzi, Soi, 
Khunsari, Gazi, Sivandi, Vafsi 
B SEMNANI [2]: Semnani, Sangisari 
y CASPIAN [2]: “Gilaki, *Mazanderani 
Š TALYSH [2]: *Talysh, Harzani 
€ ZAZA-GORANI [2]: Zaza, Gorani 
С BALUCHI [2]: *Baluchi, Bashkardi 
п KURDISH [3]: *Kirmanji, “Kurdi, Kermanshahi 
ii SOUTHWEST [7]: 
a PERSIAN [2]: *rOld Persian, “Farsi (= Persian), *Tajiki 
B TATI [1]: “Тай 
y FARS [2]: Fars, Lari 
& LURI [2]: Luri, Kumzari 
D ALBANIAN [1]: *Albanian 
E GREEK [2]: tClassical Greek, "Greek, Tsakonian 
F ITALIC [16]: 
1 TOSCO-UMBRIAN [0]: *tOscan, *tUmbrian, tSabellian 
2 LATINO-FALISCAN [16]: 
[0]: *tFaliscan, *tLatin 
a ROMANCE [16]: 
i SARDINIAN [1]: *Sardinian 
ii CONTINENTAL [15]: 
о EASTERN [4]: 
I NORTH [2]: *Rumanian, *Istro-Rumanian 
H SOUTH [2]: *Megleno-Rumanian, *Arumanian 
p WESTERN [11]: 
I ITALO-ROMANCE [1]: 
А tDALMATIAN [0]: tDalmatian 
B ITALIAN [1]: “Italian 
II RHAETO- 
ROMANCE [3]: *Friulian, Ladin, *Romansch 
III GALLO-IBERO-ROMANCE [7]: 
A. GALLO-ROMANCE [3]: 
1 NORTH [2]: "Franco-Provengal, “French 
2 SOUTH [1]: *Provengal 
B IBERO-ROMANCE [4]: 
1 NORTH [4]: 
a EASTERN [1]: *Catalan 
b CENTRAL [1]: *Spanish 
c WESTERN [2]: *Galician, *Portuguese 
2 tSOUTH [0]: tMozarabic 


8.6 Classification: Indo-Hittite 327 


G CELTIC [4]: 
1 tCONTINENTAL [0]: **Gaulish 
2 INSULAR [4]: 
a GOIDELIC [2]: "Irish (= Gaelic), "Scottish Gaelic, ЯМапх 
b BRYTHONIC [2]: *Breton, *Welsh, *tCornish 
H GERMANIC [12]: 
1 TEAST [0]: *tGothic, tVandalic, tBurgundian 
2 NORTH [5]: 
[0]: *tRunic 
a EAST [2]: *Danish, *Swedish 
b WEST [3]: “Norwegian, “Icelandic, *Faroese 
3 WEST [7]: 
a CONTINENTAL [5]: 
i EAST [3]: *German, *Yiddish, Luxembourgeois 
ii WEST [2]: *Dutch, *Afrikaans 
b NORTH SEA [2]: *Frisian, “English 
I BALTO-SLAVIC [15]: 
1 BALTIC [2]: 
а TWEST [0]: *tOld Prussian 
b EAST [2]: *Latvian, *Lithuanian 
2 SLAVIC [13]: 
a EAST [3]: 
i NORTH [2]: *Russian, *Byelorussian 
ii SOUTH [1]: *Ukrainian 
b WEST [6]: 
i NORTH [2]: *Polish, *Kashubian, *tPolabian 
ii CENTRAL [2]: *Upper Sorbian, Lower Sorbian 
iii SOUTH [2]: *Czech, *Slovak 
c SOUTH [4]: 
[0]: *tOld Church Slavonic 
i WEST [2]: *Slovene, *Serbo-Croatian 
ii EAST [2]: *Macedonian, *Bulgarian 
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URALIC-YUKAGHIR [24]: 
I YUKAGHIR [1]: *Yukaghir, tChuvantsy, TOmok 
II URALIC [23]: 
A SAMOYED [4]: 
1 NORTH [3]: *Nenets, *Enets, "Nganasan 
2 SOUTH [1]: *Selkup, *tKamas 
B FINNO-UGRIC [19]: 
1 UGRIC [3]: 
[1]: "Hungarian 
a OB-UGRIC [2]: *Xanty, "Mansi 
2 FINNIC [16]: 
a PERMIC [2]: *Udmurt, *Komi 
b VOLGAIC [2]: “Mari, *Mordvin 
c NORTH FINNIC [12]: 
i SAAMIC (= LAPPIC) [3]: “Northern Saami, "Eastern Saami, 
"Southern Saami 
ii BALTIC FINNIC [9]: *Finnish, *Ingrian, *Karelian, Olonets, 
Ludic, *Vepsian, *Votic, *Estonian, *Livonian 


ALTAIC [63]: 
I ALTAIC PROPER [59]: 
A TURKIC [31]: 
[0]: *tOld Turkic 
1 BOLGAR [1]: *Chuvash 
2 COMMON TURKIC [30]: 
a SOUTHERN [7]: 
[5]: *Khalaj, *Gagauz, *Turkish, *Crimean Turkish, *Turkmen 
i AZERBAIJANI [2]: *Azerbaijani, Qashqay 
b EASTERN [3]: *Uighur, *Uzbek, *Salar 
c WESTERN [7]: 
[1]: *Bashkir 
i KUMYK-KARACHAY [3]: *Karachay, *Karaim, *Kumyk 
ii TATAR [3]: *Tatar, "Baraba, Crimean Tatar 
d CENTRAL [4]: *Nogai, *Karakalpak, *Kazakh, *Kirghiz 
e NORTHERN [9]: 
i YAKUT [2]: *Yakut, Dolgan 
ii TUVA-ALTAI [7]: 
[1]: *Khakas 
а ALTAI [2]: *Altai, “Northern Altai 
p CHULYM-SHOR [2]: *Chulym, *Shor 
y TUVA-KARAGAS [2]: *Tuva, *Karagas 
B MONGOLIAN-TUNGUS [28]: 
1 MONGOLIAN [12]: 
[0]: *tClassical Mongolian 
a WESTERN [1]: *Moghol 
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b EASTERN [11]: 
i DAGUR [1]: Dagur 
ii MONGUOR [4]: *Monguor, Yellow Uighur, *Pao-an, Santa 
iii OIRAT-KHALKHA [6]: 
a OIRAT-KALMYK [2]: Oirat, *Kalmyk 
B KHALKHA-BURIAT [4]: 
I BURIAT [1]: *Buriat 
П MONGOLIAN PROPER [3]: *Khalkha, *Khorchin, 
*Ordos 
2 TUNGUS [16]: 
a NORTHERN [6]: 
[2]: *Even, *Negidal 
i EVENKI [4]: *Evenki, Solon, Manegir, Orochon 
b SOUTHERN [10]: 
i SOUTHWESTERN [1]: tJu-chen, *Manchu 
ii SOUTHEASTERN [9]: 
a NANAJ [7]: Akani, Birar, *Gold, Kile, *Olcha, *Orok, 
Samagir 
В UDIHE [2]: *Oroch, *Udihe 
II KOREAN-JAPANESE [4]: 
[2]: *Korean, *Ainu 
A JAPANESE-RYUKYUAN [2]: Japanese, *Ryukyuan 


CHUKCHI-KAMCHATKAN [5]: 
I NORTHERN [4]: 

A CHUKCHI [1]: *Chukchi 

B KORYAK-ALYUTOR [3]: *Kerek, *Koryak, *Alyutor 
П SOUTHERN [1]: *Kamchadal 


ESKIMO-ALEUT [9]: 
I ALEUT [1]: *Aleut 
H ESKIMO (8): 
A INUIT [3]: Alaskan Inuit, "Canadian Inuit, *Greenlandic 
B YUPIK [5]: 
1 ALASKAN [2]: Alutiiq, “Central Yupik 
2 SIBERIAN [3]: 
[1]: Sirenik 
a CHAPLINO-NAUKAN [2]: *Chaplino, *Naukan 


330 8 CLASSIFICATION ОЕ THE WORLD'S LANGUAGES 


ELAMO-DRAVIDIAN (28): 
I TELAMITE [0]: *tElamite 
H DRAVIDIAN (28): 
А NORTHWEST [1]: *Brahui 
B DRAVIDIAN PROPER [27]: 
1 NORTHEAST [2]: *Kurux, *Malto 
2 CENTRAL [13]: 
a KOLAMI-PARJI [4]: 
i KOLAMI-NAIKI [2]: *Kolami, Naiki 
ii PARJI-GADABA [2]: *Parji, *Gadaba 
b TELUGU-KUI [9]: 
i GONDI-KUI [7]: 
а GONDI-KOYA [2]: *Gondi, *Koya 
p KONDA-KUI [5]: 
[1]: *Konda 
I MANDA-KUI [4]: 
A MANDA-PENGO [2]: Manda, "Pengo 
B KUI-KUVI [2]: *Kui, *Kuvi 
ii TELUGU [2]: *Telugu, Savara 
3 SOUTH [12]: 
а TULU [3]: *Tulu, Bellari, *Koraga 
b TAMIL-KANNADA [9]: 
i KANNADA [2]: “Kannada, Badaga 
ii TAMIL-KODAGU [7]: 
a KODAGU [2]: *Kodagu, *Kurumba 
p TODA-KOTA [2]: *Toda, *Kota 
y TAMIL-IRULA [3]: 
[1]: *Irula 
I TAMIL [2]: “Tamil, “Malayalam 
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І SINITIC (12): 
А BAI [4]: Егупап, Hoking, Bai, Tali 
B CHINESE [8]: 
[0]: *FArchaic Chinese 
1 MIN [2]: "Northern Min, "Southern Min 
2 MANDARIN-YUE [6]: 
a MANDARIN [1]: *Mandarin 
b XIANG [1]: *Xiang 
c WU [1]: *Wu 
d GAN-KEJIA [2]: *Gan, Kejia 
e YUE [1]: *Yue (= Cantonese) 
H TIBETO-KAREN [246]: 
А KAREN [14]: 
1 PHO [4]: 
[1]: *Pa-o 
a PHO-PHLON [3]: *Pho, Leke, *Phlon 
2 SGAW-BWE [10]: 
[1]: Brec 
a KAYAH [2]: Kayah, Yinbaw 
b BWE [4]: Bwe, Geba, Gekho, Padaung 
с SGAW [3]: *Sgaw, Paki, *Mopwa 
В ТІВЕТО-ВОКМАМ [232]: 
1 ТІВЕТІС [75]: 
[2]: Kaman, Hruso 
a NEWARI-PAHRI [2]: *Newari, Pahri 
b DIGARO-MIDU [2]: Digaro, Midu 
c DHIMAL-TOTO [2]: Dhimal, Toto 
d ADI-NISHI [7]: *Lepcha, Adi, Tagen, *Nishi, Apatani, Yano, 
Lho-pa 
e BODIC [10]: 
[4]: Monpa, Gyarung, Kaike, Ghale 
i TAMANG [4]: 
[2]: "Tamang, Manang 
a GURUNG-THAKALI [2]: *Gurung, *Thakali 
ii TIBETAN [2]: *tClassical Tibetan, *tZhang-zhung, "Central 
Tibetan, *Western Tibetan 
f CENTRAL HIMALAYAN [7]: 
[5]: “Марат, Кай, Raute, Kham, *Kusunda 
i VAYU-CHEPANG [2]: Vayu, *Chepang 
g WEST HIMALAYAN [17]: 
i NORTH [4]: Bunan, Thebor, Sumchu, Sungam 
ii NORTHWEST [7]: 
о KANAURI (А): Kanauri, Chitkhuli, Tukpa, Kanashi 
P MANCHATI [3]: Manchati, Chamba, Rangloi 
ін ALMORA [4]: Rangkas, Darmiya, Chaudangsi, Byangsi 
iv EASTERN [2]: Thami, Bhramu 
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h EAST HIMALAYAN [21]: 
i WESTERN [8]: 
[2]: *Thulung, Chaurasya 
о BAHING [3]: Bahing, Umbule, *Sunwar 
p DUMI [3]: Dumi, *Khaling, Rai 
ii EASTERN [13]: 
а KHAMBU [3]: Khambu, Nachereng, Kulung 
В BONTAWA [10]: 
[3]: Rodong, Lambichong, Athpare 
I LOHORONG [3]: Lohorong, Yakha, *Limbu 
II WALING [4]: Waling, Rungchenbung, Kiranti, 
Dungmali 
i DZORGAIC [5]: Ch'iang, Dzorgai, Kortse, Pingfang, Thochu 
2 BARIC [16]: 
a BODO-GARO [7]: 
[1]: Chutiya 
i BODO [4]: *Bodo, Dimasa, *Tripuri, Lalung, TMoran 
ii GARO [2]: *Garo, "Koch 
b CHANG-TANGSA [9]: Tangsa, Moshang, Shangge, *Nocte, 
Konyak, Wancho, Phom, Chang, Mon 
3 BURMIC [141]: 
a KUKI-NAGA [71]: 
[3]: *Meithei, *Mikir, Mru 
i NORTHERN МАСА [8]: 
[2]: Yimchungru, *Lotha 
о SANGTAM [3]: Lophomi, Thukumi, Pochuri 
В АО [3]: "Ao, Mongsen, Tengsa 
ii EASTERN NAGA [8]: 
о RENGMA [3]: Rengma, Ntenyi, Meluri 
В SIMI [3]: Kezhama, Mao, Zumomi 
y ANGAMI [2]: *Angami, Chakrima 
ii SOUTHERN NAGA [3]: Maring, *Tangkhul, Kupome 
iv WESTERN KUKI [5]: Maram, Liangmai, Nruanghmei, 
Khoirao, Zeme 
v NORTHERN KUKI [6]: *Kamhau, Ralte, Siyin, *Thado, Vuite, 
Zo 
vi OLD KUKI [14]: 
[1]: Kyao 
a LAMGANG [2]: Lamgang, Anal 
p KOLHRENG [3]: Kolhreng, Kom, Tarao 
y CENTRAL [4]: Chiru, Aimol, Purum, Langrong 
6 WESTERN [4]: 
[1]: Southern Luhupa 
I NORTHERN [2]: Hrangkhol, Biate 
II SOUTHERN [1]: Hallam 
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vii CENTRAL KUKI [12]: 
[1]: Langet 
a MIZO [5]: *Mizo, Zahao, Hmar, Pankhu, Paang 
В HAKA [2]: Haka, Shonshe 
Y LAKHER [4]: *Mara, Tlongsai, Hawthai, Sabeu 
viii SOUTHERN KUKI [12]: 
а SHO [7]: *Sho, Khyeng, Khyang, Thayetmo, Minbu, 
Chinbon, Lemyo 
B CHINBOK [2]: Chinbok, Ng'men 
y KHAMI [3]: *Khami, Khumi, Ngala 
b KACHIN-LUIC [19]: 
[1]: Taman, TChairel 
i KACHIN [3]: Jinghpaw, Hkauri, Hka-hku, tili 
ii LUIC [5]: TAndro, Sengmai, *Kadu, Sak, Chakpa, Phayeng 
ii RAWANG [10]: Krangku, Lungmi, Nung, *Rawang, Zithung, 
Trung, Metu, Melam, Tamalu, Tukiumu 
c BURMESE-MOSO [51]: 
i MOSO [2]: Moso, Nazi 
ii BURMESE-LOLO [49]: 
о BURMIC [6]: 
I NORTHERN [5]: Phun, Achang, "Maru, Lashi, 
*Tsaiwa 
H SOUTHERN [1]: *Burmese 
B LOLO [43]: 
[3]: Daignet, Duampu, Nameji, *tSihia, tPai-lang, 
tTosu | 
I NORTHERN LOLO [14]: 
[11]: Thongho, Pakishan, Kangsiangying, 
Kiaokio, Nee, Laichau, Tudza, Nuoku, Liang- 
shan, P'ou-la, Sani 
A NASO [3]: Ulu, Ko-p'u, Weining 
П EAST LOLO [2]: Khoany, Mung 
HI CENTRAL LOLO [6]: 
[1]: *Lisu 
А ТОТО PROPER [5]: Nyi, Chókó, Ahi, Lolopho, 
Phupha 
IV SOUTHERN LOLO [13]: 
А PHUNOI [6]: *Phunoi, Руеп, "Bisu, Mpi, 
Khaskhong, Hwethom 
B AKHA [7]: *Akha, Ako, Asong, Phana, 
Menghwa, Hani, *Lahu 
V RESIDUAL LOLO [5]: 
[1]: Ugong 
A HSIFAN [4]: Manyak, Horpa, Menia, Muli 
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AUSTRIC [1,175]: 
I MIAO-YAO [4]: 
A MIAO [1]: *Miao 
B YAO [3]: “Laka, *Punu, *Mien (= Yao) 
II AUSTROASIATIC [155]: 
A MUNDA [17]: 
1 NORTH [8]: 
a KORKU [1]: *Korku 
b KHERWARI [7]: 
i SANTALI [2]: *Santali, Turi 
ii MUNDARI [5]: *Mundari, *Ho, Korwa, Birhor, Asuri 
2 SOUTH [9]: 
a CENTRAL [2]: *Kharia, *Juang 
b KORAPUT [7]: 
i SORA-GORUM [3]: 
a GORUM [1]: *Gorum 
B SORA-JURAY (2): “Sora, Juray 
ii GUTOB-REMO-GTA' [4]: 
a GUTOB-REMO [2]: *Gutob, *Remo 
В GTA’ [2]: "Plains Gta’, Hill Gta’ 
B MON-KHMER [138]: 
1 NORTH [43]: 
a KHASI [2]: *Khasi, *Amwi 
b PALAUNGIC-KHMUIC [34]: 
[1]: Mang 
i PALAUNGIC [27]: 
a EAST [7]: 
[1]: *Danau 
I PALAUNG-RIANG [6]: 
[1]: *Riang 
A PALAUNG [5]: *Shwe, Da’ang, Rumai, Bonglong, Pale 
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В WEST [20]: 
I LAMET-KHAMET [2]: Lamet, Khamet 
II WAIC [12]: 
А BULANG [5]: Phang, *P’uman,, Plang, Tai-Loi, Kem Degne 
В WA-LAWA [7]: “Wa, “Lawa, La, Phalok, Son, En, K'ala 
III ANGKUIC [6]: Mok, Angku, Коп Keu, P’uman,, Pou Ma, Kiorr 
ii KHMUIC [6]: 
о YUMBRI-MRABRI [2]: Yumbri, Mrabri 
В MAL-KHMU’ [4]: 
[1]: *Khmu’ 
I MAL [3]: *Mal, Phray, Phai 
c VIET-MUONG [7]: 
i GROUP I [3]: *Thavung, Pakatan, Arem 
ii GROUP II [1]: Poong 
iii GROUP III [2]: May, *Muong 
iv GROUP IV [1]: *Vietnamese 
2 EAST [72]: 
a KATUIC [28]: 
i WEST [14]: 
a SO-BRU [10]: So, *Bra, Mangkong, Tri, Chali, Khua, Leung, Kaleu, Galler, Van Kiéu 
В KUY-SUEI [4]: *Suei, Na Nhyang, *Kuy, Yeu 
ii EAST [14]: 
[2]: Katang, Tareng 
а PACÓH-PHUONG [2]: *Pacóh, Phuong 
В KATU-THAP [3]: “Katu, Kantu, Thap 
y TA-OY-TONG [4]: Ta-Oy, Tong, Ong, Kha In 
ò NGEQ-NKRIANG [3]: *Ngeq, Nkriang, Kha Koh 
b BAHNARIC [37]: 
i SOUTH [8]: 
а STIENG-CHRAU [2]: *Stieng, *Chrau 
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[II AUSTROASIATIC] 
p SRE-MNONG [6]: 
[2]: *Sre, Maa’ 
I MNONG [4]: 
[2]: “Eastern Mnong, Biat 
A SOUTHERN-CENTRAL [2]: Southern Mnong, *Central Mnong 
ii WEST [13]: 
[6]: “Loven, Lave, Sok, Sapuan, Cheng, Suq 
о NYAHON-PROUAC [2]: Nyahóñ, Prouac 
В OI-THE [2]: Oi, The 
y BRAO-KRAVET [3]: Brao, Kru’ng, Kravet 
iii CENTRAL [4]: *Bahnar, Alak, Tampuan, Lmam 
iv NORTH [12]: 
a EAST [3]: 
[1]: Takua 
I CUA-KAYONG [2]: Cua, Kayong 
B WEST [9]: 
[2]: “Кепрао, Duan 
I JEH-HALANG [2]: *Jeh, *Haláng 
II SEDANG-TODRAH [5]: 
А SEDANG [2]: *Sedang, “Нге 
B TODRAH-MONOM [3]: 
[1]: Monom 
1 TODRAH [2]: *Modra, *Didra 
c KHMER [1]: "Khmer (= Cambodian) 
d PEARIC [6]: 
i EASTERN [1]: Pear 
ii WESTERN [5]: 
[1]: Suoy 
о SAMRE [2]: Samre, *Somray 
B CHONG [2]: Chong, Sa'och 
3 SOUTH [23]: 
а MONIC [2]: "Mon, Nyah Kur 
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b ASLIAN [19]: 
i SEMANG-JAH HUT [15]: 
[1]: Jah Hut 
о SEMANG-SENOIC [14]: 
I SEMANG [9]: 
[1]: "Che" Wong 
A WESTERN [3]: *Kensiu, *Kenta’ Bong, Mos 
B EASTERN [5]: "Jehai, *Mendriq, *Bateg Deq, *Mintil, *Bateg Nong 
II SENOIC [5]: 
[1]: *Semai 
A NORTH [4]: 
[1]: *Temiar 
1 LANOH-SEMNAM [3]: *Lanoh, *Sabum, *Semnam 
ii SOUTHERN [4]: *Mah Meri, "Semaq Beri, *Semelai, *Temoq 
с NICOBAR ISLANDS [2]: Shompen, *Nicobarese 
HI AUSTRO-TAI [1,016]: 
A DAIC [57]: 
1 LATI-GELAO [2]: Lati, Gelao 
2 LI-KAM-TAI [55]: 
a LI-LAQUA [3]: 
[1]: Li 
i LAQUA-LAHA [2]: Laqua, Laha 
b BE-KAM-TAI [52]: 
[1]: "Be 
i LAKKIA-KAM-TAI [51]: 
о LAKKIA [1]: *Lakkia 
В KAM-TAI [50]: 
I KAM-SUI [6]: Kam, *Mak, Sui, Then, Maonan, Mulao 
JI TAI [44]: 
A NORTHERN [18]: Ch'ien-chiang, Zhuang, Giay, Hsi-lin, Kwei-yang, Ling-yun, Lung-an, 
Nhang, *Bouyei, Po-se, Qui-chau, *Saek, Tien-chow, T'ien-pa, Zhongjia, Tushan, *Wu-ming, 
"Yay 
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IA DAIC] 
В CENTRAL [8]: Man Cao-lan, *Nung, Tay, Tho, Tien-pao, Ts'un-lao, Yung-chu'un, *Lung- 
chow 
C SOUTHWESTERN [18]: *Khamti, *Khün, “Гао (-- Laotian), Lü, Maw, Phu Thai, Phuan, 
*Shan, “Thai (= Siamese), Northern Thai, Southern Thai, “White Tai, “Black Tai, Red Tai, Tay 
Nua, Ya, Yuan, Yunnan Shant'ou 
B AUSTRONESIAN [959]: 
1 ATAYALIC [2]: *Atayal, *Sedeq 
2 TSOUIC [4]: 
[1]: *Rukai 
a TSOUIC PROPER [3]: 
i NORTHERN [1]: *Tsou 
ii SOUTHERN [2]: *Kanakanabu, *Saaroa 
3 PAIWANIC [8]: 
[5]: *Bunun, *Paiwan, *Puyuma, *Saisiyat, "Ami 
а SINICIZED [3]: Kavalan, *Pazeh, “Thao, tKetangalan, tBasay, tiTaokas, tPapora, tBabuza, tHoanya, tSiraya 
4 MALAYO-POLYNESIAN [945]: 
a WESTERN MALAYO-POLYNESIAN [374]: 
[3]: “Chamorro, *Palauan, *Yapese 
i NORTHERN PHILIPPINES [53]: 
a NORTHERN LUZON [41]: 
I NORTHERN CORDILLERAN [15]: 
[6]: "Isnag, *Yogad, Adasen, Malaweg, *Agta, Itawis 
A GADDANG [2]: *Gaddang, *Ga-dang 
B IBANAG-ATTA [2]: *Ibanag, *Atta 
C DUMAGAT [5]: 
[1]: "Urniray Dumaget 
1 NORTHERN [4]: Southeast Cagayan Agta, Kasiguranin, *Casiguran Dumagat, Paranan 
II ILOKANO [1]: *Ilokano 
III SOUTH-CENTRAL CORDILLERAN [25]: 
A CENTRAL CORDILLERAN [18]: 
[1]: Isinai 
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1 KALINGA-ITNEG [5]: 
[1]: "Itneg 
a KALINGA [4]: Kalinga-Itneg, *North Kalinga, *Central Kalinga, *South Kalinga 
2 NUCLEAR CORDILLERAN [12]: 
[1]: *Balangaw 
a IFUGAO [3]: Kiangan, *Central Ifugao, *East Ifugao 
b BONTOK-KANKANAY [8]: 
i BONTOK [4]: Sadanga, East Bontok, South Bontok, *Central Bontok 
ii KANKANAY [4]: Kankanay-Itneg, “North Kankanay, Central Kankanay, South 
Kankanay 
B SOUTHERN CORDILLERAN [7]: 
[1]: “Hongot 
1 PANGASINIC [6]: 
[1]: *Pangasinan 
a BENGUET [5]: 
[1]: Fwak 
i IBALOI-KARAW [2]: *Inibaloi, Karaw 
H KALLAHAN [2]: “North Kallahan, South Kallahan 
В BASHIIC-CENTRAL LUZON-NORTHERN MINDORO [12]: 
I BASHIIC [4]: 
[1]: Yami 
А IVATAN [3]: *Itbayaten, *Ivatanen, Babuyan 
II CENTRAL LUZON [5]: 
[2]: Sinauna, *Kapampangan 
A SAMBALIC [3]: *Tina, Botolan, *Bolinao 
Ш NORTHERN MINDORO [3]: “хауа, Alangan, Tadyawan 
ii SOUTHERN PHILIPPINES [23]: 
о SUBANUN [5]: 
[2]: "Siocon Subanon, Kalibugan 
I EASTERN [3]: *Sindangan, Salug, Lapuyan 


ore 


[a WESTERN MALAYO-POLYNESIAN] 
В MANOBO [15]: 
I NORTH [3]: Карауапеп, *Binukid, Kinamigin 
H CENTRAL [8]: 
A SOUTH (3): 
[1]: Оһо 
1 ATA-TIGWA [2]: *Ata, *Tigwa 
В EAST [3]: *Agusan, *Dibabawon, Rajah Kabungsuan 
С WEST [2]: “Шапеп, *Western Bukidnon 
Ш SOUTH (4): 
[2]: Tagabawa, “Затапрапі Manobo 
А СОТАВАТО [2]: *Kalamansig, Tasaday 
y DANAO [3]: 
[1]: *Magindanao 
I MARANAO-IRANON 121: *Maranao, Iranon 
ñi MESO-PHILIPPINE [54]: 
а SOUTH MANGYAN [3]: 
[1]: *Hanunoo 
I BUHID-TAUBUID [2]: *Buhid, Taubuid 
В KALAMIAN [2]: *Kalamianen, Agutaynon 
y PALAWAN [5]: *Batak, *Tagbanwa, Palaweño, *Molbog, Banggi 
Š CENTRAL PHILIPPINES [44]: 
I TAGALOG [3]: *Tagalog, Lubang, Marinduque 
П BIKOL [8]: 
[1]: Pandan, 
A INLAND [5]: 
[1]: Iriga 
1 BUHI-DARAGA |4): 
[1]: Ваһі 
a LIBON-DARAGA [3]: 
[1]: Libon 
i OAS-DARACA [2]: Oas, Daraga 
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В COASTAL [2]: Virac, “Мара 
III MANSAKAN (8): 
(11: Davaweno 
A NORTH [1]: Kamayo 
B EASTERN [4]: 
[2]: Isamal, Caraga 
1 MANDAYAN [2]: Mandayan, *Mansakan 
C WESTERN [2]: "Kalagan, *Tagakaolo 
IV MAMANWA [1]: “Mamanwa 
V BISAYAN [24]: 
A SOUTH [5]: 
1 SURIGAO [3]: Surigaonon, Jaun-Jaun, Kantilan 
2 BUTUAN-TAUSUG [2]: *Tausug, Butuanon 
В CEBUAN [2]: "Cebuano, Boholano 
C CENTRAL [8]: 
1 WARAYAN [3]: 
[1]: Gubat 
a SAMAR-WARAY [2]: North Samar, *Waray 
2 PERIPHERAL [4]: Camotes, Bantayan, Masbate, *Hiligaynon 
3 ROMBLON [1]: Romblonanon 
D BANTON [2]: Banton, Odionganon 
E WEST [7]: 
1 KUYAN [2]: *Kuyunon, Datagnon 
2 KINARAYAN [2]: Kinaray-a, Pandan; 
3 NORTH-CENTRAL [2]: Looknon, Bulalakawnon 
4 AKLAN [1]: Aklanon 
iv SOUTH MINDANAO [5]: 
[2]: Bagobo, *Tiruray 
a BILIC [3]: 
[1]: *Tboli 
I BLAAN [2]: *Cotabato Bilaan, *Sarangani Bilaan 
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[a WESTERN MALAYO-POLYNESIAN] 
v CELEBES [61]: 
a SANGIR-MINAHASAN [10]: 
I SANGIRIC [5]: Talaud, Sangir, *Sangil, Bantik, Ratahan 
П MINIHASAN [5]: *Tonsea, *Tondano, *Tombulu, *Tontemboan, *Tonsawang 
В MONGONDOW-GORONTALO [9]: 
I MONGONDOW [2]: Ponosakan, Mongondow 
II GORONTALO [7]: Bintauna, Kaidipang, Bolango, Atinggola, Suwawa, Gorontalo, Buol 
y CENTRAL SULAWESI [25]: 
[1]: Banggai 
I EASTERN [3]: Saluan, Andio, Balantak 
II WEST-CENTRAL [21]: 
[1]: Balaesan 
A TOMINI [8]: Tolitoli, Tomini, Bolano, Dondo, Dampelasa, Kasimbar, Petapa, Ndau 
B KAILI-PAMONA [8]: Kaili, Pipikoro, Lindu, Sedoa, Мара, Вада, Rampi, Pamona 
C BUNGKU-MORI [4]: Bungku, Mori, Lalaki, Mekongka 
8 SOUTH SULAWESI [9]: Makassarese, “Вирів, Mandar, *Sa'dan, Massenrempulu, Pitu Ulunna Salo, 
Mamuju, Seko, Wotu 
е MUNA-BUTON [8]: Muna, *Wolio, South Buton, Buton, Tukang-Besi, Bonerate, Kalaotoa, Layolo 
vi BORNEO [109]: 
[1]: Ida'an 
о NORTHEAST [33]: 
I MURUT-TIDONG [12]: 
А MURUT [7]: Selungai Murut, *Kolod, *Sumambu, Paluan, *Timugon, Beaufort Murut, 
Keningau Murut 
B TIDONG [5]: Tidong, Sembakung, Baukan, Kalabakan, Serudung 
II DUSUN-BISAYA [15]: 
A DUSUN [11]: Gana, Kuijau, *Kadazan-Dusun, Lotud, Papar, Rungus, *Eastern Kadazan, Klias 
River Kadazan, Kimaragang, Garo, Tebilung 
B BISAYAN [4]: Tatana, Northern Bisaya, Southern Bisaya, Tutong, 
Ш PAITANIC [6]: Dumpas, Lingkabau, Abai Sungai, Tambanua, Upper Kinabatangan, Гоби 
B APO DUAT [5]: Lundayeh, Kelabit, *Sa’ban, Lengilu, Putoh 
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y REJANG-BARAM [17]: 
I LOWER REJANG [3]: *Melanau, Kanowit, Tanjong 
II REJANG-BINTULU [4]: "Bintulu, Lahanan, Kajaman, Bukitan 
III BARAM-TINJAR [5]: Lelak, Narom, Kiput, Tutong,, Berawan 
IV REJANG-SAJAU [5]: Punan Bah, Punan Merap, Sajau Basap, Burusu, Basap 
9 LAND DAYAK [12]: Silakau, Lara’, Jagoi, Singgie, Biatah, *Bukar Sadong, Bekati’, Benyadu', 
Kembayan, Ribun, Djongkang, Sanggau 
е KAYAN-KENYAH [29]: 
[1]: Punan-Nibong 
I KAYAN [17]: 
[1]: Murik Kayan 
A WESTERN [8]: *Rejang Kayan, Baram Kayan, Mendalam Kayan, Kayan Mahakam, Busang, 
Bahau, Kayan River Kayan, Wahau Kayan 
B MODANG [2]: Modang, Segai 
С MULLER-SCHWANER “PUNAN” [6]: Aoheng, Havongan, Kereho-Uheng, Bukat, Punan 
Aput, Punan Merah 
П KENYAH [11]: 
[5]: Bakung Kenyah, Sebob Кепуаһ, Tutoh Kenyah, Punan Tubu, Wahau Kenyah 
A MAIN KENYAH [6]: Western Kenyah, Upper Baram Kenyah, Bahau River Kenyah, Kayan 
River Kenyah, Mahakam Kenyah, Kelinyao Kenyah 
5 BARITO [12]: 
I EAST [4]: *Lawangan, Dusun Deyah, *Ma’anyan, *Malagasy 
П WEST [6]: 
A NORTHWEST [2]: *Dohoi, *Siang 
В SOUTHWEST [4]: *Kapuas, *Katingan, Kahayan, Bakumpai 
Ш MAHAKAM [2]: Tunjung, Ampanang 
vii SAMA-BAJAW [20]: 
[4]: *Abaknon, Batuan, *Yakan, *Sibutuq 
a SULU-BORNEO [16]: 
I INNER SULU SAMA [3]: “North Sama, Central Sama, South Sama 
П WESTERN SULU SAMA [1]: *Pangutaran 
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[a WESTERN MALAYO-POLYNESIAN] 
HI BORNEO COAST BAJAW |12): 
[1]: Jama Mapun 
A NORTH BORNEO [3]: Kota Belud, Kawang, Putatan 
B INDONESIAN [8]: Sulamu, Kajoa, Roti, Jaya Bakti, Poso, Tongian,, Tongian,, Wallace 
viii SUNDIC [46]: 
[4]: “Javanese, “Sundanese, Mbaloh, Gayo 
a LAMPUNG [2]: *Lampung, Komering 
В BALI-SASAK [3]: "Bali, Sasak, Sumbawa 
y SUMATRA [10]: 
[3]: Lom, Mentawai, Enggano 
I NORTHERN [2]: Simeulue, Nias 
П BATAK [5]: 
[1]: Simalungan 
A NORTHERN [2]: Alas, Karo Batak 
B SOUTHERN [2]: *Toba Batak, Mandailing 
Š MALAYIC [27]: 
[1]: *Madurese 
I MALAYIC DAYAK [2]: Iban, Malayic Dayak 
П MALAY-MOKLEN [13]: 
A MOKLEN [2]: Moklen, Moken 
B MALAYAN [11]: 
1 PARA-MALAY [4]: *Minangkabau, *Urak Lawoi', Duano', Rejang 
2 ABORIGINAL MALAY [4]: Temuan, Orang Hulu, Orang Kanaq, Orang Seletar 
3 LOCAL MALAY (3): *Malay, *Indonesian, Orang Laut 
HI ACHINESE-CHAMIC [11]: 
[1]: Achinese 
A CHAMIC [10]: 
1 NORTH [1]: Huihui 
2 SOUTH [9]: 
а PLATEAU (3): Jarai, Rhade, *Haroi 
b COASTAL [6]: 
i CHAM-CHRU [3]: “East Cham, “West Cham, *Chru 
п RAGLAI [3]: South Raglai, "North Raglai, Cacgia Raglai 


Spe 


b CENTRAL-EASTERN MALAYO-POLYNESIAN [571]: 
i CENTRAL MALAYO-POLYNESIAN [89]: 
о CENTRAL MALUKU [37]: 
[1]: Teor 
I WEST [5]: 
[1]: Ambelau 
А BURU-TALIABO [4]: 
[1]: Buru 
1 SULA-TALIABO [3]: 
[1]: Taliabo 
a SULA [2]: Sula, Mangoli 
П EAST [31]: 
A BANDA [3]: 
[1]: Banda 
1 SERAN LAUT [2]: Geser, Watubela 
В SERAM |28): 
1 EAST SERAM [3]: Bobot, Masiwang, Seti 
2 NUNUSAKU [25]: 
[1]: Kayeli 
a CENTRAL SERAM [2]: Nuaulu, Manusela 
b THREE RIVERS [8]: 
[1]: Wemale 
i AMALUMUTE [7]: 
[1]: Atamanu 
a NORTHWEST SERAM [6]: 
[4]: Saleman, Hulung, Loun, Lisabata 
I ULAT INAI [2]: Naka’ela, Alune 
c PIRU BAY [14]: 
i WEST [5]: 
[1]: Asilulu 
о HOAMOAL [4]: 
I EAST [2]: Boano, Wakasihu 
II WEST [2]: Manipa, Luhu, tBatumerah 
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[D CENTRAL-EASTERN MALAYO-POLYNESIAN] 
ii EAST [9]: 
[1]: Teluti 
a SERAM STRAITS [8]: 
[3]: Kaibobo, Hitu, Paulohi 
I ULIASE [5]: 
[1]: Kamarian 
A HATUHAHA [4]: Saparua, Haruku, Nusalaut, Amahai 
B SOUTHEAST MALUKU [9]: 
[1]: Selaru 
I KEI-ARU [8]: 
[3]: Kei, Yamdena, Selwasa 
A ARU [5]: 
[2]: Ujir, Kola 
1 SOUTH [3]: Dobei, Barakai, Tarangan 
y TIMOR-FLORES [24]: 
[2]: Helong, Naueti 
I FLORES-LEMBATA [3]: *Sikka, Lamaholot, Kedang 
H TIMOR [19]: Кон, Timorese, *Tetun, *Kemak, Tukudede, Mambai, Galoli, Idate, Lakalei, Wetar, 
Roma, Kisar, *Letri Lgona, Dai, Dawera, Masela, Damar, Teun, Serua 
ô WAIMA'A [3]: Waima'a, Habu, Kairui 
€ BIMA-SUMBA [16]: Bima, Kodi, Weyewa, Lamboya, Wanukaka, Anakalangu, Mamboru, East 
Sumbanese, *Savu, Ndao, “Manggarai, Riung, *Ngada, Ende, Lio, Palu'e 
ii EASTERN MALAYO-POLYNESIAN [482]: 
a SOUTH HALMAHERA-NORTHWEST NEW GUINEA [56]: 
I BOMBERAI [9]: 
[1]: Kaiwai 
A ONIN [6]: Uruangnirin, Onin, Sekar, Arguni, Bedoanas, Erokwanas 
B KUIWAI [2]: Irahutu, Nabi 
H SOUTH HALMAHERA-GEELVINK BAY [47]: 
A SOUTH HALMAHERA [23]: 
1 SOUTH [2]: Gane, East Makian 
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2 CENTRAL-EAST [21]: 
[14]: Buli, Maba, Patani, Sawai, Weda, Misool, Amber, Wajamli, Bichole, Giman, 
Salawati, Waigeo, Batanta, As 
a RAJA EMPAT [7]: Maya, Maden, Palamul, Matbat, Laganyan, Kawe, Gebe 
B GEELVINK BAY [24]: 
[6]: Waropen, *Mor,, Iresim, Yaur, Yeretuar, Tandia 
1 BIAKIC [4]: Biak, Ron, Dusner, Meoswar 
2 YAPEN [14]: 
a EAST [2]: Wabo, Kurudu 
b CENTRAL-WESTERN [12]: Wandamen, Ansus, Woi, Pom, Aibondeni, Marau, 
| Munggui, Papuma, Busami, Serui-Laut, Wadapi-Laut, Ambai 
В OCEANIC [426]: 
[1]: Maisin 
I 5АКМІ-ҮОТАҒА [6]: 
A SARMI [3]: Sobei, Bongo, Tarpia 
B YOTAFA [3]: Yotafa, Ormu, Kayupulau 
II SIASSI [44]: 
A SEPIK-MADANG [22]: 
1 SEPIK [13]: 
a WESTERN [5]: Sera, *Sissano, Tumleo, Ali, Ulau 
b EASTERN [8]: 
[1]: Kis 
i KAIRIRU [2]: Kairiru, Kaiep 
ii МАМАМ [5]: Wogeo, Bam, Sepa, *Manam, Medebur 
2 MADANG [9]: 
a BEL [6]: 
i NORTHERN [5]: *Takia, *Megiar, *Matukar, *Gedaged, Bilbil 
ii SOUTHERN [1]: “Ham 
b ASTROLABAN [3]: Wab, *Biliau, Mindiri 
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[B OCEANIC] 
B RAI COAST-NORTHWEST NEW BRITAIN (22): 
1 RAI COAST [7]: 
[2]: "Sio, Mangap 
a KORAP [3]: Lukep, Malasanga, Barim 
b ROINJI-NENGAYA [2]: Nengaya, Roinji 
2 BARIAI-NGERO [8]: 
а NGERO [4]: Tami, Mutu, Gitua, Malamalai 
b BARIAI [4]: 
i CAPE GLOUCESTER [2]: Maleu, Kilenge 
ii KOVE-BARIAI [2]: Bariai, *Kove 
3 HUON GULF [7]: 
a YABIM-BUKAWAC [5]: *Yabim, *Bukawac, Lae, Labu, "Kela 
b KAIWA-SIBOMA [2]: Kaiwa, Siboma 
III MARKHAM [23]: 
A ADZERA [14]: 
1 LOWER WATUT [3]: Silisili, Dangal, Maralango 
2 MARKHAM PROPER [7]: *Adzera, Wampur, Unank, Sukurum, Ngariawan, Sirasira, 
Wampar 
3 BUSU [4]: Sirak, Guwot, Musom, Yalu 
B BUANG [6]: *Mumeng, Piu, Kapin, *Mapos, “Manga, Vehes 
C HOTE [3]: Hote, Yamap, Misim 
IV MILNE BAY- CENTRAL PROVINCE [54]: 
А MILNE BAY [41]: 
1 EASTERN [3]: "Misima, Nimowa, *Tagula 
2 WESTERN [38]: 
a MINIAFIA-UBIR [2]: *Miniafia, Ubir 
b MUKAWA [6]: Doga, Anuki, Mukawa, Gapapaiwa, Boianaki, Wataluma 
c KAKABAI [2]: Dawawa, Igora 
d TAUPOTA-WEDAU [5]: Wedau, Taupota, Garuwahi, *Kehelala, Kukuya 
e SUAU [3]: *Suau, Bohutu, Wagawaga 
f DUAU [5]: Tubetube, Duau, Bunama, Kurada, Mwatebu 
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g DOBU (Б): Sewa Bay, *Dobu, Molima, Galeya, Bosilewa 
h GUMASI [1]: Gumasi 
і BWAIDOKA [6]: Yamalele, Fagululu, Kalokalo, “Bwaidoka, Diodio, *Iduna 
j KILIVILA [3]: Budibud, *Muyuw, Kilivila 
B CENTRAL PROVINCE [13]: 
1 WESTERN [6]: Mekeo, *Roro, Kuni, *Nara, *Gabadi, Doura 
2 EASTERN [7]: 
a MOTU-SINAGORO [3]: *Motu, *Sinagoro, *Keapara 
b MAGORI [4]: Ouma, Magori, Yoba, Bina 
V KIMBE [6]: 
А NAKANAI [2]: Melamela, *Bileki 
B WILLAUMEZ [3]: Xarua, Bola, Bulu 
C BALI-VITU [1]: Vitu 
VI NEW BRITAIN [16]: 
[1]: Tumuip 
A ARAWE [3]: Arawe, Moewehafen, Gasmata 
B LAMOGAI [6]: 
[3]: Longa, Idne, Mok 
1 RAUTO [3]: Lamogai, Pulie, Rauto 
C WHITEMAN [3]: Pasismanua, Kapore, Mangseng 
D MENGEN [3]: Uvol, Mamusi, *Mengen 
VII NEW IRELAND-TOLAI [21]: 
A PATPATAR-TOLAI [10]: Barok, Patpatar, "Sursurunga, *Tangga, Lihir, Konomala, Kandas, Siar, 
*Tolai, “Duke of York 
B NORTHERN NEW IRELAND [7]: Lavongai, *Tigak, "Kara, Tiang, Nalik, Notsi, Tabar 
C ST. MATTHIAS [2]: Mussau, Tenis 
D MADAK [2]: Lamusong, *Madak 
VIII ADMIRALTY ISLANDS [23]: 
A WESTERN [2]: 
1 WUVULU [1]: Wuvulu 
2 NINIGO [1]: *Seimat, tKaniet 
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[В ОСЕАМІС) 
В EASTERN [21]: 
1 MANUS [16]: 
a NORTHWEST ISLANDS [7]: Hermit, Sisi, Sori, Ponam, Апага, Leipon, Loniu 
b WEST [4]: Lindrou, Levei, Mondropolon, Likum 
c EAST [5]: *Gele, Nali, Mokerang, Papitalai, Titan 
2 SOUTHEAST ISLANDS [5]: Baluan, Lenkau, Penchal, Nauna, Pak 
IX BOUGAINVILLE [14]: 
A NORTH & EAST [12]: 
1 NORTH-NEHAN [9]: 
a NEHAN [1]: *Nehan 
b NORTH [7]: 
i BUKA [3]: *Halia, Solos, *Petats 
ii SAPOSA-TINPUTS [4]: *Saposa, Hahon, *Tinputs, *Teop 
c PAPAPANA [1]: Papapana 
2 EAST [3]: “Mono, *Uruava, *Torau 
B WEST [2]: Banoni, Piva 
X CHOISEUL [4]: 
A NORTHWEST [3]: Vaghua, Varisi, Ririo 
В CENTRAL-EASTERN [1]: *Babatana 
XI NEW GEORGIA [10]: 
A EAST [2]: Marovo, Vangunu 
B WEST [8]: Ghanongga, Lungga, *Simbo, Nduke, *Roviana, Ughele, Kusaghe, Hoava 
XII SANTA ISABEL [7]: 
A. WEST [2]: *Kia, Laghu 
B CENTRAL [3]: Kokota, Zazao, Blablanga 
C EAST [2]: Maringe, *Gao 
XIII SANTA CRUZ [6]: 
A UTUPUA [3]: *Nembao, Asumboa, Tanimbili 
B VANIKORO [3]: Buma, *Vano, Tanima 
XIV SOUTHERN NEW HEBRIDES [8]: 
[1]: "Aneityum 
А ERROMANGA [2]: "Sie, Ura 
B TANNA [5]: *Kwamera, "Whitesands, North Tanna, *Lenakel, Southwest Tanna 
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XV NEW CALEDONIA [24]: 
A NORTHERN (13): 
1 FAR NORTH [4]: Yalayu, *Kumak, *Caac, *Yuaga 
2 NORTH [7]: 
[3]: Pwapwa, Pwaamei, *Hmwaveke 
a NEMI [4]: Jawe, Nemi, *Fwai, Pije 
3 CENTRAL [2]: *Cemuhi, *Paici 
B SOUTHERN [11]: 
1 SOUTH [9]: 
a GROUP I [5]: Arho, Arha, *Ajie, Orowe, Neku 
b GROUP II [2]: Меге, *Tiri 
c GROUP III [2]: *Xaracuu, *Xaragure 
2 FAR SOUTH [2]: Dubea, *Numee 
XVI LOYALTY ISLANDS [3]: *Iaai, "Dehu, *Nengone 
XVII REMOTE ОСЕАМІС [156]: 
А MICRONESIAN [9]: 
[1]: *Nauruan 
1 MICRONESIAN PROPER (8): 
[3]: *Gilbertese, *Marshallese, *Kusaiean 
a PONAPEIC-TRUKIC [5]: 
i PONAPEIC [2]: *Mokilese, *Ponapean 
ii TRUKIC [3]: *Trukese, *Carolinian, *Ulithian, TMapia 
B SOUTHEAST SOLOMONS [22]: 
1 GELA-GUADALCANAL [7]: 
[1]: "Bughotu 
a GELA [2]: *Gela, *Lengo 
b GUADALCANAL [4]: *Ghari, *Talise, Malango, Вігао 
2 MALAITA-SAN CRISTOBAL [15]: 
a MALAITA [11]: 
[1]: "Longgu 
i NORTHERN [5]: "Lau, *To’abaita, *Kwara'ae, Langalanga, "Kwaio 
ii SOUTHERN [5]: Dori'o, "Are'are, *Marau, *Oroha, *Ѕа'а 
b SAN CRISTOBAL [4]: *Arosi, “Ғарапі, Bauro, Kahua 
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[XVII REMOTE OCEANIC] 
C CENTRAL & NORTHERN NEW HEBRIDES [94]: 
1 EAST SANTO [5]: 
a NORTH [1]: *Sakao 
b SOUTH [4]: Lorediakarkar, "Shark Bay, Butmas, Polonombauk 
2 MALEKULA INTERIOR [12]: 
[1]: *Labo 

a SMALL NAMBAS [4]: Letemboi, Repanbitip, Dixon Reef, Nasarian 

b MALEKULA CENTRAL [7]: Katbol, Lingarak, Vinmavis, Litzlitz, Larevat, Maragus, 
“Бір Nambas 

3 NORTHEAST NEW HEBRIDES- BANKS ISLANDS [77]: 

a EAST NEW HEBRIDES [29]: Hiw, *Toga, *Lehali, Lehalurup, *Motlav, "Mota, 
*Vatrata, *tAlo-Teqel, *Mosina, *Nume, Koro, Wetamut, *Lakona, *Merlav, “Marino, 
Central Maewo, Baetora, *Northeast Aoban, Nduindui, *Raga, Apma, Sowa, Seke, 
Sa, "North Ambrym, *Lonwolwol, Dakaka, Port Vato, "Southeast Ambrym, *Paama 

b WEST SANTO [24]: Valpei, *Nokuku, Vunapu, Piamatsina, *Tolomako, Tasmate, 
Wusi, Akei, Malmariv, Navut, Lametin, Morouas, Roria, Fortsenal, Amblong, 
Wailapa, *Tangoa, Araki, Narango, Tambotalo, Aore, *Malo, Tutuba, Mafea 

с MALEKULA COASTAL [16]: Malua Bay, Vovo, Mpotovoro, Mae, Vao, *Atchin, 
*Uripiv, Unua, “Кегер, *Aulua, Burmbar, Port Sandwich, *Maskelynes, Ахагар, 
Malfaxal, *Southwest Bay 

d EPI [5]: *Lewo, Bierebo, “Baki, Maii, Bieria 

e CENTRAL NEW HEBRIDES [3]: *Namakura, *North Efate, *South Efate 

D CENTRAL PACIFIC [31]: 
1 ROTUMAN-FIJIAN [3]: 
[1]: *Rotuman 
a FIJIAN [2]: 
i WEST [1]: Nadroga 
ii EAST [1]: "Fijian 
2 POLYNESIAN [28]: 
a TONGIC [2]: *Niuean, *Tongan 
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b NUCLEAR POLYNESIAN [26]: 
i SAMOIC OUTLIER [17]: 
[7]: *Samoan, Niuafo'ou, *Tuvalu, “East Uvea, Pukapuka, Rennellese, *Pileni 
о SOUTHERN [5]: *Tikopia, *West Uvea, *Emae, *Mele, *West Futuna 
В NORTHERN [3]: Sikaiana, *Luangiua, Takuu 
y MICRONESIAN [2]: *Kapingamarangi, *Nukuoro 
ii EASTERN POLYNESIAN [9]: 
[1]: *Rapanui (= Easter Island) 
о CENTRAL [8]: 
I TAHITIC [5]: *Tahitian, Tongareva, *Rarotongan, *Pa'umotu, *Maori, 
TMoriori 
H MARQUESIC [3]: *Mangareva, *Marquesan, *Hawaiian 
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INDO-PACIFIC [731]: 
I TTASMANIAN [0]: ** Tasmanian 
II ANDAMAN ISLANDS [4]: 
A GREAT ANDAMANESE [1]: 
1 TNORTHERN [0]: tAka-Cari, tAka-Kora, tAka-Bo, tAka-Jeru 
2 CENTRAL [1]: TAka-Kede, tAka-Kol, tOko-Juwoi, *A-Pucikwar, 
tAkar-Bale, tAka-Bea 
B SOUTH ANDAMANESE [3]: Onge, Jarawa, Sentinel 
Ш TRANS-NEW GUINEA [505]: 
[5]: *Oksapmin, *Morwap, Molof, Usku, Tofamna 
A MAIN SECTION [267]: 
1 CENTRAL & WESTERN [216]: 
[2]: Dem, Mor, 
a FINISTERRE-HUON [67]: 
i HUON [22]: 
[1]: *Kovai 
о EASTERN [8]: *Kate, "Mape, *Dedua, Sene, Momare, 
Migabac, *Kube, *Kosorong 
В WESTERN [13]: "Ono, Sialum, Nomu, Kinalakna, 
Kumokio, *Selepet, "Timbe, “Komba, *Tobo, *Yaknge, 
*Burum, Mesem, *Nabak 
ii FINISTERRE [45]: 
[1]: Abaga 
о ERAP [11]: Мак, Мек, Nakama, Munkip, Numanggang, 
Sauk, Gusan, Finungwa, Nimi, *Urii, Mamaa 
B WANTOAT [4]: Irumu, Yagawak, Bam, "Wantoat 
y GUSAP-MOT [7]: Ufim, Nahu, *Rawa, Nekgini, Neko, 
Ngaing, Gira 
9 WARUP [8]: Dahating, Bulgebi, Guiarak, Morafa, Forak, 
Degenan, Yagomi, Asat 
е YUPNA [9]: Mebu, Nankina, Gabutamon, Domung, 
*Bonkiman, Wandabong, Nokopo, *Kewieng, Isan 
¢ URUWA [5]: Som, Sakam, Yau, Komutu, Weleki 
b EAST NEW GUINEA HIGHLANDS [38]: 
[2]: *Wiru, Kenati 
i KALAM [3]: 
[1]: Gants 
a KALAM-KOBON [2]: *Kalam, Kobon 
ii EASTERN [7]: 
[1]: Owena 
a GADSUP-AUYANA-AWA [3]: *Gadsup, *Auyana, “Ауға 
В TAIRORA [3]: *Tairora, *Binumarien, *Waffa 
iii EAST-CENTRAL [8]: 
[2]: “Сепдае, “Kamano 
a SIANE [2]: *Siane, Yabiyufa 
В GAHUKU [2]: *Gahuku, *Benabena 
y FORE [2]: *Fore, *Gimi 
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iv CENTRAL (9): 
[1]: *Medlpa 
о CHIMBU [3]: *Chimbu, *Chuave, Nomane 
В WAHGI [2]: *Wahgi, *Nii 
y JMI [3]: *Narak, “Maring, Ganja 
v WEST-CENTRAL [9]: 
[1]: *Huli 
а ENGA [5]: "Enga, Katinja, Nete, Lembena, Ipili 
В ANGAL-KEWA [3]: *Angal, *Kewa, “Зай 
c CENTRAL & SOUTH NEW GUINEA-KUTUBUAN [59]: 
i KUTUBUAN [5]: 
о EAST [2]: *Foe, Fiwaga 
B WEST [3]: *Fasu, *Some, *Namumi 
ii CENTRAL & SOUTH NEW GUINEA [54]: 
[2]: Somahai, *Kamula 
а ASMAT-KAMORO [8]: *Iria, *Asienara, *Kamoro, 
*Sempan, “Central Asmat, Casuarina Coast Asmat, Citak 
Asmat, North Asmat 
В AWYU-DUMUT [14]: 
[7]: *Sawuy, Kotogut, Mapi, Ederah, Kia, Upper 
Digul, Upper Kaeme 
I AWYU [4]: *Siagha, *Pisa, *Aghu, Airo 
II DUMUT [3]: *Kaeti, *Wambon, Wanggom 
y OK [13]: 
I LOWLAND [5]: "Southern Kati, “Northern Kati, 
*Yonggom, *Ninggirum, Iwur 
H MOUNTAIN [8]: *Telefol, *Tifal, *Kauwol, *Faiwol, 
Setaman, *Bimin, *Mianmin, *Ngalum 
6 AWIN-PA [2]: *Awin, “Ра 
е EAST STRICKLAND [3]: *Nomad, Agala, Konai 
С BOSAVI [8]: *Beami, Onabasulu, *Kaluli, Kasua, Kware, 
Tomu, *Bainapi, Sonia 
ті DUNA-POGAYA [2]: *Duna, Pogaya 
Ө MOMBUM [2]: "*Mombum, Koneraw 
d ANGAN [12]: 
[1]: *Angaataha 
i ANGAN PROPER [11]: *Simbari, *Baruya, *Safeyoka, 
*Kawatsa, *Kamasa, *Yagwoia, *Ankave, *Ivori, *Lohiki, 
*Menya, *Hamtai 
e GOGODALA-SUKI [3]: 
[1]: *Suki 
i GOGODALA [2]: *Gogodala, *Waruna 
f MARIND [6]: 
i BOAZI [2]: *Boazi, *Zimakani 
ii MARIND PROPER [2]: *Marind, *Bian Marind 
ін YAQAY [2]: *Yaqay, *Warkay 
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g KAYAGAR [3]: *Kaugat, *Kaygir, *Tamagario 
h SENTANI [4]: 
[1]: Demta 
i SENTANI PROPER [3]: *Sentani, Nafri, Tanah Merah, 
i DANI-KWERBA [11]: 
i SOUTHERN [6]: 
[1]: Wano 
a DANI [2]: “Western Dani, “Grand Valley Dani 
В NGALIK-NDUGA [3]: North Ngalik, South Ngalik, 
Nduga 
ii NORTHERN [5]: 
[2]: Saberi, Samarokena 
a KWERBA [3]: *Kwerba, Airoran, Sasawa 
j WISSEL LAKES-KEMANDOGA [4]: 
[1]: *Uhunduni 
i EKAGI-WODANI-MONI [3]: *Ekagi, "Wodani, “Moni 
k MAIRASI- TANAH MERAH [4]: 
[1]: Tanah Merah, 
i MAIRASI [3]: Mairasi, Northeastern Mairasi, Semimi 
1 WEST BOMBERAI [3]: 
[1]: *Karas 
i WEST BOMBERAI PROPER [2]: "Iha, “Ваһат 
2 EASTERN [51]: 
a BINANDEREAN [16]: 
[1]: *Guhu-Semane 
i BINANDEREAN PROPER [15]: *Suena, *Yekora, *Zia, 
*Binandere, Ambasi, “АеКа, *Orokaiva, Hunjara, *Notu, 
Yega, Gaina, Baruga, Mawae, Dogoro, "Korafe 
b GOILALAN [5]: *Biangai, *Weri, “Kunimaipa, Tauade, Fuyuge 
c KOIARIAN [6]: 
i KOIARIC [3]: *Koita, *Koiari, "Mountain Koiari 
ii BARAIC [3]: *Managalasi, *Barai, *Omie 
d KWALEAN [2]: *Humene, *Kwale, tMulaha 
e MANUBARAN [2]: Doromu, *Maria 
f YAREBAN [5]: Bariji, *Yareba, Doriri, Sirio, Abia 
g MAILUAN [6]: “Magi, Domu, Morawa, Binahari, Bauwaki, Laua 
h DAGAN [9]: Onjob, Maiwa, Jimajima, *Daga, Марепа, Turaka, 
Gwedena, Ginuman, Sona 
B MADANG-ADELBERT RANGE [102]: 
1 MADANG [58]: 
a RAI COAST [29]: 
i EVAPIA [5]: Sinsauru, Asas, Sausi, Kesawai, Dumpu 
ii KABENAU [5]: Arawum, Kolom, *Siroi, Lemio, Pulabu 
iii YAGANON [4]: Yabong, Ganglau, Dumun, Saep 
iv PEKA [4]: Usino, Sumau, Urigina, Danaru 
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у NURU [7]: Usu, "Erima, Duduela, Kwato, Rerau, Jilim, 
Yangulam 
vi MINDJIM [4]: Bom, Male, Bongu, Songum 
b MABUSO [29]: 
[1]: Kare 
i KOKON [3]: *Girawa, Munit, Bemal 
ii СОМ [6]: Sihan, Gumalu, Isebe, *Amele, *Bau, Panim 
ін HANSEMAN [19]: Rapting, Wamas, Samosa, Мигирі, Saruga, 
Nake, Mosimo, Garus, Yoidik, Rempi, Bagupi, Silopi, Utu, 
Mawan, Baimak, Matepi, Gal, Garuh, Kamba 
2 ADELBERT RANGE [44]: 
a PIHOM-ISUMRUD-MUGIL [28]: 
[1]: Mugil 
i PIHOM [22]: 
[2]: Amaimon, Wasembo 
о KAUKOMBARAN [4]: Pay, Pila, Saki, Tani 
P KUMILAN [3]: *Ulingan, Bepour, Moere 
y TIBORAN [5]: Kowaki, Mawak, Hinihon, Musar, 
Wanambre 
6 OMOSAN [2]: Koguman, Abasakur 
е NUMAGENAN [6]: Wanuma, Yaben, Yarawata, Bilakura, 
Parawen, Ukuriguma 
ii ISUMRUD [5]: 
[1]: Dimir 
a KOWAN [2]: Korak, *Waskia 
В MABUAN [2]: Malas, Bunabun 
b JOSEPHSTAAL-WANANG [12]: 
i JOSEPHSTAAL [7]: 
[2]: Osum, Wadaginam 
о SIKAN [2]: Sileibi, Katiati 
В POMOIKAN [3]: Pondoma, Ikundun, Moresada 
ii WANANG [5]: 
[1]: Paynamar 
о АТАМ [2]: Atemple, Angaua 
В EMUAN [2]: Emerum, Musak 
c BRAHMAN [4]: Isabi, Biyom, Tauya, Faita 
C TEBERAN-PAWAIAN [3]: 
[1]: *Pawaian 
1 TEBERAN [2]: *Dadibi, *Podopa 
D TURAMA-KIKORIAN [4]: 
[1]: *Kairi 
1 TURAMA-OMATIAN [3]: Omati, Ikobi, Mena 
E INLAND GULF [3]: 
[1]: Ipiko 
1 МІМАМІВАТ [2]: *Minanibai, “Тао, tKarami, tMahigi 
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F ELEMAN [7]: 
[2]: Purari, Tate 
1 EASTERN [2]: *Toaripi, Uaripi 
2 WESTERN [3]: Opao, Keuro, *Orokolo 
G TRANS-FLY-BULAKA RIVER (28): 
1 BULAKA RIVER [2]: *Yelmek, *Maklew 
2 TRANS-FLY [26]: 
a KIWAIAN [7]: “Southern Kiwai, Wabuda, Bamu Kiwai, Morigi, 
Kerewo, *Northeastern Kiwai, *Arigibi 
b TIRIO [4]: Тігіо, Aturu, Lewada, Mutum 
c EASTERN TRANS-FLY [4]: "Bine, Gidra, Gizra, "Meriam 
d PAHOTURI [3]: *Agób, Idi, *Waia 
e MOREHEAD & UPPER MARO RIVERS [8]: 
[2]: Yey, Moraori 
i NAMBU [2]: *Nambu, Dorro 
ii TONDA [4]: Upper Morehead, Lower Morehead, *Tonda, 
Kanum 
H MEK [9]: 
1 WESTERN [6]: Kosarek, Nipsan, *Nalca, Korapun, Goliath, Eipo 
2 EASTERN [3]: Ketengban, Sirkai, Kinome 
I SENAGI [2]: "Anggor, “Dera 
J PAUWASI [4]: 
1 EASTERN [2]: Yafi, Emumu 
2 WESTERN [2]: Dubu, Towei 
K NORTHERN [32]: 
1 BORDER [12]: 
a WARIS [7]: “Waris, "Manem, Senggi, Waina, Daonda, Simog, 
*Amanab 
b TAIKAT [2]: *Awyi, *Taikat 
с BEWANI [3]: Pagi, Kilmeri, Ninggera 
2 TOR-LAKE PLAIN [20]: 
[3]: Turu, Mawes, Uria 
a CENTRAL LAKE PLAIN [7]: Baburiwa, Taogwe, Taori-Kei, Tori 
Aikwakai, Papasena, Weretai, Taori-5o 
b EAST LAKE PLAIN [3]: Taworta, Dabra, Foau 
c TOR [7]: Berik, Bonerif, Mander, Itik, Kwesten, Maremgi, Wares 
L NIMBORAN [3]: Mekwei, Kemtuk, *Nimboran 
M KAURE [5]: 
[2]: Sause, Kapori 
1 KAURE PROPER [3]: Kaure, Narau, Kosare 
N SOUTH BIRD'S HEAD [10]: 
1 SOUTH BIRD'S HEAD PROPER [6]: Barau, Arandai, Tarof, Kasuweri, 
Puragi, Kampong Baru 
2 INANWATAN [2]: Inanwatan, Duriankere 
3 KONDA-YAHADIAN [2]: Konda, Yahadian 
O KOLOPOM [3]: *Kimaghana, *Riantana, *Ndom 
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P TIMOR-ALOR-PANTAR [18]: 
[7]: Oirata, Lovaea, Fataluku, Kairui, Bunak, Kolana, Tanglapui 
1 ALOR-PANTAR [11]: 
[1]: Makasai 
a ALOR [6]: Kui, *Woisika, Abui, Kelon, Kafoa, Kabola 
b PANTAR [4]: *Blagar, Tewa, Nedebang, Lamma 
IV WEST PAPUAN [24]: 
[1]: Amberbaken 
A BIRD'S HEAD [10]: 
1 NORTH-CENTRAL [4]: 
a NORTH [2]: Karon Pantai, Madik 
b CENTRAL [2]: Karon Dori, Brat 
2 WEST [6]: Kuwani, Tehit, Kalabra, Seget, Moi, Moraid 
В BORAI-HATTAM [2]: Borai, Hattam 
C NORTHERN HALMAHERA [11]: 
1 SOUTH [2]: Ternate, Tidore 
2 NORTH [9]: Galela, Tobelo, Loda, Ibu, Sahu, Modole, Tabaru, Pagu, 
*West Makian 
V EAST BIRD'S HEAD [3]: 
[1]: Mantion 
А MEAX [2]: Meax, Meningo 
VI GEELVINK BAY [5]: 
[1]: Yava 
А EAST GEELVINK BAY [4]: Turunggare, Baropasi, Bauzi, Bapu 
VII SKO [8]: 
А VANIMO [4]: "Sko, Sangke, Wutung, *Vanimo 
B KRISA [4]: Krisa, Rawo, Puari, "Warapu 
ҮШ KWOMTARI-BAIBAI [5]: 
[1]: Pyu 
A KWOMTARI [2]: Kwomtari, *Fas 
В BAIBAI [2]: Baibai, Biaka 
ІХ ARAI [6]: “Rocky Peak, Шегі, Bo, “Атпа, Nimo, Owiniga 
X AMTO-MUSIAN [2]: Amto, Musian 
XI TORRICELLI [48]: 
[1]: “Urim 
A. WEST WAPEI [3]: *One, Seta, Seti 
B WAPEI-PALEI [20]: 
[1]: "Urat 
1 WAPEI [12]: Yis, Yau, “Olo, *Elkei, “Ай, “Үй, Alu, *Ningil, Gnau, Galu, 
Yapunda, Valman 
2 PALEI [7]: Nambi, Agi, Aruop, “Кауік, Aiku, Bragat, Aru 
C MAIMAI [6]: 
[3]: Laeko, Beli, Wiaki 
1 MAIMAI PROPER [3]: Siliput, *Yahang, Heyo 
D KOMBIO-ARAPESH [9]: 
1 KOMBIO [6]: Eitiep, Lou, Kombio, Yambes, Aruek, Wom 
2 ARAPESH [3]: "Mountain Arapesh, "Southern Arapesh, Bumbita 
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E MARIENBERG [7]: Bungain, Mandi, Muniwara, Urimo, Kamasau, Elepi, 
Buna 
F MONUMBO [2]: *Monumbo, Lilau 
XII SEPIK-RAMU [98]: 
[1]: Gapun 
A SEPIK [43]: 
[1]: *Biksi 
1 UPPER БЕРІК [5]: 
[1]: *Abau 
a IWAM [2]: "Iwam, Amal 
b WOGAMUSIN [2]: *Wogamusin, *Chenapian 
2 RAM [3]: Karawa, Bouye, Autu 
3 TAMA [5]: Kalou, Pasi, Pahi, *Mehek, "Mayo 
4 YELLOW RIVER [3]: *Namie, Ak, Awun 
5 MIDDLE SEPIK [11]: 
[1]: Yerakai 
a NUKUMA [2]: *Kwoma, *Kwanga 
b NDU [8]: *Ngala, *Manambu, *Kaunga, *Abelam, *Boiken, Sawos, 
"Jatmul, Kwasengen 
6 SEPIK HILL [15]: 
a SANIO [6]: *Sanio, Paka, Gabiano, Piame, Bikaru, “Hewa 
b BAHINEMO [7]: Bitara, Bahinemo, Mari, Bisis, Watakataui, 
Kapriman, Sumariup 
с ALAMBLAK [2]: Kaningara, *Alamblak 
B LEONARD SCHULTZE [6]: 
1 WALIO [4]: Walio, Pai, Yabio, Tuwari 
2 PAPI [2]: Papi, Duranmin 
C NOR-PONDO [6]: 
1 NOR [2]: *Murik, Kopar 
2 PONDO [4]: *Chambri, Yimas, Karawari, *Angoram 
D RAMU [42]: 
1 YUAT-WAIBUK [13]: 
a MONGOL-LANGAM [3]: Langam, Mongol, Yaul 
b YUAT-MARAMBA [61: 
[1]: Maramba 
i YUAT [5]: Changriwa, Mekmek, Miyak, Biwat, Bun 
c WAIBUK [4]: Waibuk, Aramo, Pinai, Wapi 
2 RAMU PROPER [29]: 
a GRASS [5]: 
[1]: *Banaro 
i GRASS PROPER [4]: *Kambot, Aion, Adjora, Gorovu 
b ARAFUNDI [2]: Alfendio, Meakambut 
c ANNABERG [3]: 
[1]: Rao 
i AIAN [2]: Anor, Aiome 
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а RUBONI (8): 
i OTTILIEN [5]: Watam, Gamei, Kaian, Bosman, Awar 
ii MISEGIAN [3]: *Giri, Sepen, Mikarew 
e GOAM [11]: 
i ATAITAN [4]: Andarum, Igom, Tangu, Tanguat 
ii TAMOLAN [7]: Romkun, Breri, Kominimung, Igana, Akrukay, 
Itutang, Midsivindi 
XIII EAST PAPUAN [23]: 
A YELE-SOLOMONS-NEW BRITAIN [12]: 
1 YELE-SOLOMONS [5]: 
[1]: “Yele 
a TKAZUKURU [0]: tKazukuru, tGuliguli, TDororo 
b CENTRAL SOLOMONS [4]: Bilua, Baniata, Lavukaleve, *Savosavo 
2 NEW BRITAIN [7]: 
[5]: Sulka, Kol, Wasi, Anem, Panaras 
a BAINING-TAULIL [2]: *Baining, *Taulil, TButam 
B BOUGAINVILLE [8]: 
1 EAST [4]: 
a NASIOI [2]: *Nasioi, Nagovisi 
b BUIN [2]: "Buin, Siwai 
2 WEST [4]: 
[2]: Konua, Keriaka 
a ROTOKAS [2]: *Rotokas, Eivo 
C REEF ISLANDS- SANTA CRUZ [3]: 
[1]: *Aiwo 
1 SANTA CRUZ [2]: "Santa Cruz, *Nanggu 
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AUSTRALIAN (170): 
[14]: *Enindhilyagwa, Ndjébbana, *Yanyuwa, Gagudju, Kungarakany, 
Mangarayi, TMingin, Nakkara, *Nunggubuyu, *Tiwi, Waray, Limilngan, 
Umbugarla, Gunbudj, Murrinh-Patha 
I YIWAIDJAN [4]: 
[2]: Margu, Amurag 
A YIWAIDJIC [2]: "Iwaidja, *Maung 
II MANGERRIAN [1]: Mangerr 
III GUNWINYGUAN [9]: 
[3]: Jawony, Rembarrnga, *tNgandi, Ngalakan 
А GUNWINYGIC [3]: Warlang, *Gunwinygu, *Ngalkbon 
B YANGMANIC [3]: 
[2]: “Wagiman, Wardaman 
1 NOLGIN [1]: tDagoman, Yangman 
IV BURARRAN [2]: Guragone, *Burarra 
V MARAN [2]: 
[1]: *Alawa 
A MARIC [1]: tWarndarang, Mara 
VI WEST BARKLY [3]: 
[1]: "Djingili 
A WAMBAYAN [2]: "Ngarndji, "Wambaya 
VII GARAWAN [2]: *Garawa, Waanyi 
VHI LARAGIYAN [2]: Laragiya, Wulna 
IX DALY [9]: 
A MOIL [1]: *Ngenkikurrunggur 
B BRINKEN-WAGAYDY [5]: 
1 BRINKEN [3]: *Marithiyel, *Marimanindji, *Maringarr 
2 WAGAYDY [2]: "Wadjiginy, *Maranungku 
C MALAKMALAK [3]: 
1 DALY [1]: *Madngele 
2 MALAKMALAK PROPER [2]: *Dyeraidy, *Malakmalak 
X DJAMINDJUNGAN [3]: Djamindjung, Nungali, Ngaliwuru 
XI DJERAGAN [3]: 
А MIRIWUNGIC [2]: Gadjerawang, Miriwung 
B KITJIC [1]: *Kitja 
XII BUNABAN [2]: Bunaba, Guniyan 
XIII NYULNYULAN [2]: *Nyulnyul, tYawuru, tWarwa, Nyigina 
XIV WORORAN [3]: Wunambal, *Ungarinjin, Worora 
XV PAMA-NYUNGAN [109]: 
[5]: tMuk Thang, tDhuduroa, “Kala Lagaw Ya (= Mabuiag), TMuruwari, 
Warumungu, Flinders Island, Barrow Point, *Bandjalang, 
tPallangahmiddang 
A YUULNGU [7]: *Djinang, *Dhangu, *Dhuwal, Djinba, *Ritharngu, Dhay’yi, 
Yan-nhangu 
B TANGIC [2]: *Lardil, Gayardilt 
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С PAMAN (25): 

1 NORTHERN [3]: TGudang, *Atampaya, TWudhadhi, Tjungundji, 
*TMpalityanh, TYupngayth, *tLinngithigh, *tNgkoth, *tYinwum, 
*tAritinngithigh, *Awngthim, *tMbiywom 

2 MIDDLE [7]: *Wik-Me’anha, Wik-Ngathana, *Wik-Mungkan, Wik- 
Iiyanh, *Kugu-Muminh, Pakanha, Ayabadhu 

3 WESTERN [2]: Thaayorre, *Yir Yoront 

4 NORTH-EASTERN [2]: Kaantyu, *Umpila 

5 LAMALAMIC [4]: Umbindhamu, *Umbuygamu, Mbariman- 
Gudhinma, *Lama-Lama, tGugu Warra 

6 COASTAL [1]: Koko Pera, *tGog Nar 

7 CENTRAL [4]: *Thaypan, Aghu-Tharnggala, tGogo Mini, tDagalag, 
Kawarrangg, *Oykangand 

8 NORMAN [1]: *Kurtjar, TKuthant, tWalangama 

9 FLINDERS [1]: *Gugadj 

10 tSOUTHERN [0]: *tAgwamin, *tMbabaram, *tMbara 
11 TMAYABIC [0]: tMayi-Kutuna, tMayi-Kulan 
D YALANJIC [2]: *Guugu Yimidhirr, “Gugu Yalandyi 
E YIDINYIC [2]: *Dyaabugay, *Yidiny 
F DYIRBALIC [2]: *Dyirbal, "Warrgamay 
G NYAWAYGIC [1]: *Nyawaygi, TWulguru 
H MARIC [5]: 
[0]: Сауға, tYanda, tYirandhali 

1 KAPU [2]: Gunggari, Birria 

2 MARI [3]: Gugu Badhun, tGudjala, tYilba, tBiri, tGiya, tYiningay, 
tWadjalang, tGayiri, tGangulu, tBidyara, TYiman, Margany, Guwamu 

I WAKA-KABIC [1]: 

1 TKINGKEL [0]: tDarambal, tBayali 

2 TTHAN [0]: tGureng Gureng, tGabi 

3 MIYAN [1]: tWuliwuli, “Waga, tBarunggam, tMuringam 

J tDURUBALIC [0]: tTurrubal, tGowar 
K GUMBAYNGGIRIC [1]: *Gumbaynggir, *tYaygir 
L YUIN-KURIC (1): 

1 KURI [1]: tYugambal, tNganyaywana, *Dyangadi, *tGadang, 
tAwabakal 

2 ТҮСІМ [0]: tGudungura, *tNgarigu, *tThawa, tDyirringany, tDhurga, 
*tDharawal 

3 НҮОКА [0]: tDarkinyung, tDharuk 

M WIRADHURIC [2]: *tGamilaraay, Ngiyambaa, Wiradhuri 
М ВААСАМОЛ [1]: Baagandji 

O tYOTAYOTIC [0]: tYotayota, tYabula-Yabula 

P +KULINIC [0]: 

1 TKULIN [0]: *tWemba Wemba, tNari Nari, TWathawurung, tKolakngat, 
*+Wuywurrung 

2 tBUNGANDID] [0]: tBungandidj, tKuurn Kopan Noot 
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Q tNGARINYERIC-YITHAYITHIC [0]: “ЖМрагіпуегі, TNgayawung, TYuyu, 
TKeramin, tYitha-Yitha 
R KARNIC [14]: 
1 PALKU [2]: *Lhanima, *Pitta-Pitta 
2 ARABANA [1]: *Arabana 
3 KARNA [6]: Midhaga, Garuwali, Yarluyandi, Ngamini, Yandruwandha, 
*Diyari 
4 NGURA [3]: “Wangkumara, Badjiri, Bidjara 
5 YARLI [2]: *Malyangaba, Yardliwarra 
S WAGAYA-WARLUWARIC [3]: 
1 WAGAYA [1]: Wagaya 
2 WARLUWARIC [2]: Yindjilandji, Warluwara 
T KALKATUNGIC [1]: *Kalkatungu, *tYalarnnga 
U ARANDIC [3]: 
1 ARTUYA [1]: *Kaititj 
2 URTWA [2]: *Alyawarra, “Атапда, thower Aranda 
V SOUTH-WEST [31]: 
1 NGUMBIN [5]: "Walmajarri, Djaru, Gurindji, Mudburra, Ngarinman 
2 MARNGU [3]: *Garadyari, Mangarla, *Nyangumarta 
3 NGAYARDA [8]: *Ngarla, *Nyamal, *Nyiyabali, tTjurruru, *Kariyarra, 
Martuyhunira, Nhuwala, *Yindjibarndi, Binigura 
4 MANTHARDA [3]: *Wariyangga, Djwarli, *Thargari 
5 KANYARA [2]: *Dhalandji, *Bayungu 
6 TKARDU [0]: *tYinggarda, tMaya, tMalgana, *tNhanda 
7 WADJARI [3]: *Wadjari, Wirdimay, *Karlamay 
8 WATI [2]: tWirangu, *Warnman, "Western Desert Language 
9 NGARGA [3]: Warlmanpa, *Warlpiri, *Ngardi 
10 tNYUNGAR [0]: *tNyungar 
11 NGADJUNMAYA [1]: *Ngadjunmaya 
12 YURA [1]: *Adynyamathanha, tBanggarla, tKaurna 
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NA-DENE [34]: 
I HAIDA [1]: *Haida 
H CONTINENTAL NA-DENE [33]: 
А TLINGIT [1]: *Tlingit 
B ATHABASKAN-EYAK [32]: 
1 EYAK [1]: *Eyak 
2 ATHABASKAN [31]: 
a TANAINA-AHTNA [2]: *Tanaina, Ahtna 
b INGALIK-KOYUKON [3]: 
[1]: Ingalik 
i KOYUKON-HOLIKACHUK [2]: Holikachuk, Koyukon 
c TANANA-UPPER KUSKOKWIM [4]: 
i TANANA [3]: Upper Tanana, Tanacross, Tanana 
ii UPPER KUSKOKWIM [1]: Upper Kuskokwim 
d TUTCHONE [2]: Northern Tutchone, Southern Tutchone 
e TAHLTAN-KASKA [2]: Tahltan, Kaska 
f TTSETSAUT [0]: tTsetsaut 
g PACIFIC COAST [2]: 
i OREGON [1]: 
а TUMPQUA [0]: TUmpqua 
B TOLOWA-GALICE [1]: *Tolowa, *tGalice 
ii CALIFORNIA [1]: 
а НОРА [1]: “Нара 
B TMATTOLE-WAILAKI [0]: tMattole, *tWailaki 
h CANADIAN [12]: 
i HAN-KUTCHIN [2]: Han, Kutchin 
ii HARE-CHIPEWYAN [4]: 
a HARE-SLAVE [3]: *Hare, Dogrib, *Slave 
В CHIPEWYAN [1]: *Chipewyan 
iii BEAVER-SEKANI [2]: Beaver, Sekani 
іу CARRIER-CHILCOTIN [3]: 
a CHILCOTIN [1]: Chilcotin, tNicola 
В BABINE-CARRIER [2]: Babine, *Carrier 
v TKWALHIOQUA [0]: tKwalhioqua 
vi SARCEE [1]: *Sarcee 
i APACHEAN [4]: 
[1]: *Kiowa Apache 
i NAVAJO-APACHE [3]: 
(11: Eastern Apache 
a WESTERN APACHE-NAVAJO [2]: “Western Apache, 
*Navajo 
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AMERIND [583]: 
I NORTHERN AMERIND [164]: 
A ALMOSAN-KERESIOUAN [68]: 
1 ALMOSAN [44]: 
a KUTENAI [1]: *Kutenai 
b ALGIC [16]: 
і RITWAN [1]: *tWiyot, *Yurok 
i ALGONOUIAN [15]: 
[0]: *tBeothuk 
a BLACKFOOT [1]: *Blackfoot 
В ALGONQUIAN PROPER [14]: 
[8]: *Cheyenne, *Arapaho, *Cree, *Ojibwa, 
*Potawatomi, *Menominee, “For, +#Шіпоіѕ, *Shawnee 
I EASTERN [6]: *Micmac, *Passamaquoddy, *Eastern 
Abenaki, Western Abenaki, thoup A, tLoup В, 
*tMassachusett, tNarragansett, tMohegan, tQuiripi, 
tUnquachog, tMahican, *Munsee, Unami, 
tNanticoke, *tPowhatan, tCarolina 
c MOSAN [27]: 
і CHIMAKUAN [1]: *tChemakum, *Quileute 
ii WAKASHAN [6]: 
о NORTHERN [3]: *Heiltsuk, *Haisla, *Kwakwala 
(7 Kwakiutl) 
В SOUTHERN [3]: Makah, *Nitinat, *Nootka 
iii SALISH [20]: 
а BELLA COOLA [1]: *Bella Coola 
В SALISH PROPER [19]: 
I COAST [8]: 
A CENTRAL [8]: 
1 NORTHERN [2]: *Comox, tPentlatch, 
Seshelt 
2 SQUAMISH [1]: *Squamish 
3 TNOOKSACK [0]: tNooksack 
4 HALKOMELEM [1]: *Halkomelem 
5 STRAITS [2]: *Clallam, *Northern Straits 
6 LUSHOOTSEED-TWANA [2]: *Lushootseed, 
*Twana 
B TTILLAMOOK [0]: *fTillamook 
H TSAMOSAN [4]: 
A INLAND [2]: "Upper Chehalis, *Cowlitz 
B MARITIME [2]: Quinault, Lower Chehalis 
HI INTERIOR [7]: 
A NORTHERN [3]: Lillooet, Thompson, *Shuswap 
B SOUTHERN [4]: *Columbian, *Okanagan, 
*Kalispel, "Coeur d'Alene 
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2 KERESIOUAN [24]: 
a KERESAN [2]: “Eastern Keres, "Western Keres 
b SIOUAN-YUCHI [11]: 
i YUCHI [1]: *Yuchi 
ii SIOUAN [10]: 
а CATAWBA [1]: *Catawba, TWoccon 
p SIOUAN PROPER [9]: 
I MANDAN [1]: *Mandan 
H NORTHERN PLAINS [2]: *Crow, *Hidatsa 
III MISSISSIPPI VALLEY [6]: 
A DAKOTA [1]: "Dakota 
В DHEGIHA [3]: *Omaha, *Osage, tQuapaw, 
Kansa 
C CHIWERE-WINNEBAGO [2]: *Chiwere, 
*Winnebago 
IV TSOUTHEAST [0]: 
A ТЕАЅТ [0]: *tTutelo 
В tSOUTH [0]: *tBiloxi, *tOfo 
c CADDOAN [4]: 
[0]: tAdai 
i SOUTHERN [1]: *Caddo 
ii NORTHERN [3]: 
а WICHITA [1]: "Wichita 
В PAWNEE-KITSAI [2]: 
I KITSAI [0]: *+Kitsai 
II PAWNEE [2]: *Pawnee, *Arikara 
d IROQUOIAN [7]: 
i NORTHERN [6]: 
[0]: TLaurentian, Huron 
a FIVE NATIONS [5]: 
[0]: tSusquehannock 
I MOHAWK-ONEIDA [2]: *Mohawk, *Oneida 
П SENECA-ONONDAGA [3]: 
A. ONONDAGA [1]: *Onondaga 
B SENECA-CAYUGA [2]: *Seneca, Cayuga 
B TUSCARORA [1]: *Tuscarora, *tNottoway 
ii SOUTHERN [1]: "Cherokee 
B PENUTIAN [68]: 
1 CANADIAN [1]: *Tsimshian 
2 WASHINGTON [1]: tLower Chinook, “Upper Chinook 
3 OREGON [3]: 
a TAKELMAN [1]: 
i TAKELMA [0]: *tTakelma 
ii KALAPUYA [1]: *Santiam, tTfalati, tYonkalla 
b COAST [2]: 
i COOS [1]: *Coos 
ii YAKONAN [1]: *tAlsea, *Siuslaw 
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4 PLATEAU [3]: 
a KLAMATH [1]: *Klamath 
b SAHAPTIN- NEZ PERCE [2]: *Sahaptin, *Nez Perce 
c TMOLALE-CAYUSE [0]: *tMolale, tCayuse 
5 CALIFORNIA [13]: 
a WINTUN [3]: Nomlaki, *Patwin, *Wintun 
b MAIDUAN [3]: “Mountain Maidu, Northwest Maidu, "Southern 
Maidu 
c YOKUTS [1]: *Foothill North Yokuts, tFoothill South Yokuts, 
*tValley Yokuts 
d MIWOK-COSTANOAN [6]: 
i TCOSTANOAN [0]: 
a tNORTHERN [0]: TOhlone, tSan Jose, tSan Francisco, 
tSanta Cruz 
B tSOUTHERN (0): *tRumsen, *tMutsun, tSoledad, 
tMonterey 
ii MIWOK [6]: 
a WESTERN [2]: *Coast Miwok, *Lake Miwok 
В EASTERN [4]: *Central Sierra Miwok, *Southern Sierra 
Miwok, *Northern Sierra Miwok, *Plains Miwok 
6 NEW MEXICO [1]: *Zuni 
7 GULF [7]: 
a tTUNICA-CHITIMACHA [0]: 
i TCHITIMACHA [0]: *tChitimacha 
ii +ТОМІСА-АТАКАРА [0]: ** Tunica, *tAtakapa 
b YUKI-WAPPO [2]: Yuki, *Wappo 
c NATCHEZ-MUSKOGEAN [5]: 
і TNATCHEZ [0]: *tNatchez 
ii MUSKOGEAN [5]: 
о WESTERN [1]: *Choctaw 
B EASTERN [4]: 
[2]: *Mikasuki, *Muskogee (= Creek), tApalachee 
I ALABAMA-KOASATI [2]: “Alabama, *Koasati 
8 MEXICAN [39]: 
a HUAVE [1]: *Huave 
b TOTONACAN [2]: *Totonac, *Tepehua 
с MIXE-ZOQUE [6]: “Mike, Oluta, *Sayula, "Sierra Popoluca, 
Texixtepec, *Zoque 
d MAYAN [30]: 
i HUASTECAN [1]: *Huastec, tChicomuceltec 
ii YOCATECAN [4]: *Yucatec, Itza, Lacandon, *Mopan 
Hi GREATER TZELTALAN (Б): 
а CHOLAN [3]: *Chol, *Chontal, tCholti, *Chorti 
8 TZELTALAN [2]: *Tzeltal, *Tzotzil 
iv GREATER KANJOBALAN [6]: 
a CHUJEAN [2]: *Tojolabal, *Chuj 


8.6 Classification: Amerind 369 


В KANJOBALAN [4]: 
[1]: Мосһо 
I KANJOBAL-JACALTEC [3]: Kanjobal, *Acatec, 
*Jacaltec 
v EASTERN [14]: 
a GREATER MAMEAN [4]: 
I MAMEAN [2]: *Teco, “Мат 
II IXILAN [2]: *Aguacatec, *Ixil 
p GREATER QUICHEAN [10]: 
[2]: Uspantec, *Kekchi 
I QUICHEAN [6]: *Quiche, *Achi, Sacapultec, 
Sipacapa, *Tzutujil, *Cakchiquel 
II POCOM [2]: Pocomam, *Pocomchi 
C HOKAN [28]: 
1 NORTHERN [11]: 
[2]: *Karok, *Shasta, *tChimariko 
а РАГАІНМІНАМ [2]: *Achumawi, *Atsugewi 
b tYANAN [0]: *rYana, TYahi 
c POMO [7]: 
[1]: *Southeastern Pomo 
i RUSSIAN RIVER & EASTERN [6]: 
[1]: “Eastern Pomo 
о RUSSIAN RIVER [5]: 
[2]: *Northeastern Pomo, *Northern Pomo 
I SOUTHERN [3]: *Southern Pomo, *Central Pomo, 
“Қазһауа 
2 WASHO [1]: *Washo 
3 TSALINAN-CHUMASH [0]: TSalinan, *FChumash, tEsselen 
4 SERI-YUMAN (11): 
a SERI [1]: “бегі 
b YUMAN [10]: 
i KILIWA [1]: *Kiliwa 
ii DELTA-CALIFORNIAN [2]: *Cocopa, *Diegueno 
iii RIVER [3]: *Mohave, *Yuma, *Maricopa 
iv PAI [4]: 
о PAIPAI [1]: *Paipai 
В UPLAND YUMAN [3]: *Walapai, *Havasupai, *Yavapai 
5 TWAICURI-QUINIGUA [0]: tWaicuri, tMaratino, tQuinigua 
6 COAHUILTECAN [1]: 
[1]: *Tonkawa, tKarankawa 
a TCOAHUILTECAN PROPER [0]: *tCoahuiltec, TCotoname, 
*tComecrudo 
7 TEQUISTLATECAN [2]: *Tlamelula, *Tequistlatec 
8 SOUTHERN [2]: 
[1]: "Jicaque, tYurimangui 
a TLAPANECAN [1]: *Tlapanec, *tSubtiaba 
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H CENTRAL AMERIND [49]: 
А TANOAN [7]: 
1 KIOWA-TOWA [2]: *Kiowa, Towa 
2 TEWA-TIWA [5]: 
a TEWA [1]: *Tewa 
b TIWA-PIRO [4]: 
і PIRO [0]: *tPiro 
ii TIWA [4]: 
a NORTHERN [2]: *Taos, *Picuris 
В SOUTHERN [2]: *Isleta, "Sandia 
B UTO-AZTECAN [25]: 
1 NUMIC [8]: 
a WESTERN [2]: "Northern Paiute, "Mono 
b CENTRAL [3]: Koso, *Shoshone, *Comanche 
c SOUTHERN [3]: *Kawaiisu, "Southern Paiute, *Chemehuevi 
2 TÜBATULABAL [1]: *Tübatulabal 
3 TAKIC [4]: 
[0]: tGabrielino 
a SERRAN [1]: TKitanemuk, "Serrano 
b CUPAN [3]: 
[1]: "Luisefio 
i CAHUILLA-CUPENO [2]: *Cahuilla, *Cupeño 
4 HOPI [1]: “Норі 
5 PIMIC [4]: 
a PIMAN [2]: “Pima, “Pima Bajo, *tTepecano 
b TEPEHUAN [2]: "Northern Tepehuan, *Southern Tepehuan 
6 TARACAHITIC [3]: 
[0]: fTubar 
a TARAHUMARA-GUARI/ÍO [2]: *Tarahumara, Guarijío 
b CAHITA [1]: *Yaqui, tOpata, *tCahita 
7 CORACHOL [2]: *Cora, *Huichol 
8 AZTECAN [2]: 
a tPOCHUTLA [0]: *tPochutla 
b AZTEC PROPER [2]: *Pipil, *tClassical Nahuatl, *Nahuatl 
(= Aztec) 
C OTO-MANGUEAN [17]: 
1 AMUZGO [1]: *Amuzgo 
2 OTOMIAN [6]: 
a NORTHERN [2]: *Chichimeca, *Pame 
b CENTRAL [2]: *Mazahua, *Otomi 
c SOUTHERN [2]: Matlatzinca, Ocuiltec 
3 MIXTECAN [3]: 
a TRIQUE [1]: *Trique 
b MIXTEC-CUICATEC [2]: *Mixtec, *Cuicatec 
4 POPOLOCAN [4]: 
a MAZATEC [1]: *Mazatec 
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b ІХСАТЕС-СНОСНО [3]: 
i IXCATEC [1]: *Ixcatec 
ii СНОСНО-РОРО ОС [2]: Chocho, Popoloc 
5 CHINANTECAN [1]: *Chinantec 
6 ZAPOTECAN [2]: *Chatino, *Zapotec 
7 TMANGUEAN [0]: fMangue, *TChiapanec 
Ш CHIBCHAN-PAEZAN [43]: 
A CHIBCHAN [27]: 
[4]: *Tarascan, *tCuitlatec, *Xinca, *Lenca, Paya 
1 YANOMAM [4]: *Sanuma, *Yanomami, Yanomamó, Ninam 
2 NUCLEAR CHIBCHAN [19]: 
[2]: *Motilon, *Cuna 
a MISUMALPAN [2]: TMatagalpa, *Miskito, Sumo 
b RAMA [2]: Rama, Guatuso, TGuetar 
c TALAMANCA [4]: *Cabecar, *Bribri, *Teribe, *Borunca 
d GUAYMI [2]: *Guaymi, Buglere 
e TANTIOQUIA [0]: tKatio, TNutabe, tAnserma, TArma 
Ғ ARUAK [4]: tAtangue, TGuamaca, Яса, Cagaba, Chimila, Malayo 
g CHIBCHAN PROPER [3]: 
[0]: *FChibcha, TSinsiga, TDuit 
i TUNEBO [3]: Eastern Tunebo, *Central Tunebo, Western 
Tunebo 
B PAEZAN [16]: 
[2]: *tTimucua, *Warao, TKunza, TBetoi, *tChimu, “Itonama 
1 tHUARPE [0]: tAllentiac, TMillcayac 
2 TJIRAJARAN [0]: tAyoman, tGayon, +Јітајага 
3 MURA [1]: "Mura, tMatanawi 
4 NUCLEAR PAEZAN [13]: 
[0]: tAndaqui 
a CHOCO [8]: *Embera, *Saija, Catio, Caramanta, Chami, Tado, 
Baudo, *Waunana, tRuna 
b INTER-ANDINE [2]: 
i COCONUCAN [1]: *Guambiano, tCoconuco, tTotoro 
ii PAEZ [1]: *Paez 
ñi TPOPAYAN [0]: tPopayan 
c BARBACOAN [3]: 
i PASTO [1]: *Cuaiquer, tBarbacoas 
ii CAYAPA-COLORADO [2]: *Cayapa, *Colorado 
IV ANDEAN [18]: 
A tNORTHERN [0]: 
[0]: tCulli, tSec, tLeco 
1 tCATACAO [0]: +Саѓасао, ТСоһап 
2 TCHOLONA [0]: tCholona, tHibito 
B URARINA-WAORANI [2]: Urarina, "Waorani, TOmurano 
C CAHUAPANAN-ZAPAROAN [7]: 
1 CAHUAPANAN [2]: Chayahuita, Jebero 
2 ZAPAROAN [5]: “?араго, *Andoa, *Arabela, Cahuarano, "Iquito 
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D OUECHUAN [2]: *Quechua А, *Quechua В 
E AYMARAN [2]: *Aymara, *Jaqaru 
F SOUTHERN [5]: 
1 PUELCHE [0]: *tPuelche 
2 MAPUDUNGU [2]: *Mapudungu (= Araucanian), Huilliche 
3 QAWASQAR-YAMANA [2]: *Qawasqar (= Alacaluf), *Yamana 
4 PATAGONIAN [1]: *Tehuelche, tTeushen, *TSelknam, THaush 
V EQUATORIAL-TUCANOAN [192]: 
A MACRO-TUCANOAN [47]: 
[9]: tAuishiri, Canichana, Capishana, tGamella, *Iranshe, Koaia, 
*Movima, Muniche, Natu, tPankararu, Shukuru, Uman 
1 CATUQUINAN [2]: 
a NORTHERN [0]: tCatawishi 
b SOUTHERN [2]: tBendiapa, tBurua, *Canamari, Catuquina, 
TParawa, tTaware, tTucundiapa 
2 HUARI [2]: Huari, Masaca 
3 ARUTANI-SAPE [3]: Arutani, Sape, Macu 
4 NAMBIKUARAN [4]: Northern Nambikuara, *Southern Nambikuara, 
Sabanes, Sarare 
5 PUINAVE-MAKU [6]: 
[1]: *Puinave 
a MAKU [5]: *Hupda, Guariba, Майер, Yahup, “Саспа 
6 TICUNA-YURI [1]: *Ticuna, tYuri 
7 TUCANOAN [20]: 
[1]: Miriti 
a EASTERN [12]: 
i NORTHERN [3]: *Tucano, *Guanano, Piratapuyo 
ii CENTRAL [6]: 
a BARA [1]: Tuyuca 
В DESANO [2]: Desano, Siriano 
y TATUYO [3]: *Tatuyo, Carapana, Yuruti 
Hi SOUTHERN (3): 
a MACUNA [1]: Macuna 
B BARASANO [2]: “Northern Barasano, "Southern 
Barasano 
b CENTRAL [1]: *Cubeo 
c WESTERN [6]: 
[1]: Tanimuca 
i NORTHERN [4]: 
[2]: Coreguaje, Tetete, TTama 
a SIONA-SECOYA [2]: *Siona, *Secoya, tMacaguaje 
ii SOUTHERN [1]: Orejon 
B EQUATORIAL [145]: 
[4]: “Сауауауа, *Camsa, tTaruma, *Trumai, tTusha, *Yuracare 
1 PIAROA-SALIBA [2]: Piaroa, Saliba 
2 TTIMOTE [0]: tCuica, tTimote 
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3 JIVAROAN (8): 
[2]: *Cofan, tEsmeralda, *Yaruro 
a CANDOSHI [2]: *Candoshi, Taushiro 
b JIVARO PROPER [4]: Achuar, Shuar, *Huambisa, *Aguaruna 
4 ZAMUCOAN [2]: *Ayoreo, Chamacoco 
5 KARIRI-TUPI [46]: 
a TKARIRI (0): tOzubucua, tKamaru, +Кігігі, TSapuya 
b TUPI [46]: 
[2]: Purubora, Yuqui 
i ARIKEM [1]: tArikem, tKabishiana, Karitiana 
ii MONDE [4]: Arua, Gaviáo, Surui, Monde, tDigüt 
iii RAMARAMA [1]: Arara, tRamarama, tUruku, tUrumi 
iv YURUNA [1]: Yuruna, tManitsawa, tShipaya 
v ТОРАК! [5]: *Guarategaya, Kepkiriwat, Makurapi, Tupari, 
Wayoro 
vi TUPI-GUARANI [32]: 
[2]: Guayaki, Juma 
о GROUP I [25]: 
[9]: Apiaca, Aweti, Ava, Kamayura, Kayabi, Emerillon, 
TKaripuna, Oyampi, Pauserna, tSheta, tTakunape, 
Tapirape 
I KAWAHIB [3]: tParanawat, *Tenharim, Tukumanfed, 
Wirafed 
П TENETEHARA [8]: Amanaye, tAnambe, Tembe, 
*Guajajara, Guaja, *Аѕигіпі, Arawete, Parakana, 
Urubu 
Ш TUPI [5]: *Guarani, *Kaiwa, Tapiete, Guarayu, 
*Nhengatu, tPotiguara 
B GROUP II [2]: *Cocama, Omagua 
ү GROUP Ш [3]: Mawe, *Munduruku, *Siriono, tKuruaya 
6 MACRO-ARAWAKAN (83): 
[1]: Katembri, tOtomaco 
a TINIGUAN [1]: Tinigua, tPamigua 
b GUAHIBAN [5]: tChuruya, *Guahibo, *Cuiba, Guayabero, 
Macaguan, Playero 
c ARAWAKAN [76]: 
[1]: tGuamo, tTaino, Chamicuro 
i ARAWAN [7]: tArawa, Culina, tPama, Paumari, Sewacu, 
tSipo, Deni, Yamamadi, Yuberi, Banawa 
ii CHIPAYAN [3]: *Chipaya, Uru, Callahuaya, tCaranga 
іп CHAPACURAN [11]: 
а GUAPORE [8]: 
I CHAPACURA [2]: Chapacura, Itene, tQuitemoca 
II WANYAM [6]: Cabishi, Cujuna, Cumana, Mataua, 
Wanyam, Urunamacan 
B MADEIRA [3]: Pakaasnovos, Tora, Urupa 
y TOCORONO [0]: tHerisobocono, tOcorono, tSan Ignacio 
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iv MAIPURAN [54]: 
[0]: tShebayo 
a BANIVA-YAVITERO [1]: tBaniva, Yavitero 
В PARESSI-SARAVECA [1]: *Paressi, tSaraveca 
y PRE-ANDINE [15]: 
[1]: *Amuesha 
І tAMAZONIAN [0]: tMarawa, tWaraicu 
П CUTINANA [1]: Cushichineri, tCuniba, tCutinana 
III JURUA-PURUS (2): Casharari, tCatiana, tInapari, 
Apurina, tManiteneri, TWainamari 
IV MONTANA [11]: 
[3]: *Piro, tPucapacuri, Puncuri, Sirimeri 
A CAMPA [6]: *Ashaninca, Asheninca, Pajonal, 
Caquinte, *Machiguenga, *Nomatsiguenga 
B HARAKMBET [2]: *Amarakaeri, Huachipaeri 
3 WAPISHANAN [1]: “Wapishana 
е SOUTHERN [9]: 
I PARANA [3]: Chane, Guana, *Terena 
H BOLIVIAN [6]: *Baure, Cashiniti, Chiquito, Cozarini, 
*Ignaciano, Trinitario 
¢ EASTERN (Б): 
[1]: Palicur 
I XINGU [4]: Custenau, Yawalapiti, Mehinaku, Waura 
т NORTHERN [22]: 
[2]: "Island Carib, TArua, tJaoi, *Resigaro 
I CAQUETION [1]: Achagua, TCaquetio 
H GUAYUPEAN (0): TSae 
III ORINOCO [3]: Guarequena, Bare, Guinao, 
tMawacua, tMaipure 
IV PIAPOCAN [1]: Piapoco 
V TA-ARAWAKAN [3]: *Goajiro, TGuanebucan, 
* Arawak, Paraujano 
VI RIO NEGRO [12]: 
A IZANENI [7]: Baniwa, Curripaco, Pacu, Cuati, 
Mapanai, Moriwene, Izaneni 
B MARITIPARANA [2]: Cabiyari, tMatapi, 
*Yucuna 
С WIRINA [1]: tWirina, Yabaana 
D tTARIANAN [0]: fItayaine, +Тагіапо 
E TYAPURA A [0]: tMariate, fWainuma 
F YAPURA B [2]: Cawishana, Yumana, tPasse, 
TManao 
VI GE-PANO-CARIB [117]: 
A MACRO-CARIB [47]: 
[1]: *Andoke, tCucura 
1 PEBA-YAGUAN [1]: tPeba, TYameo, *Yagua 
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2 BORA-WITOTO [9]: 
a ВОКАМ [4]: "Bora, Faai, Imihita, *Muinane 
b WITOTOAN [5]: 
[3]: Fitita, “Осаіпа, Nonuya, tOrejone, tMiranya, tCoeruna 
i WITOTO PROPER [2]: *Meneca, *Murui 
3 CARIB [36]: 
a NORTHERN [28]: 
i COASTAL [2]: 
о TVENEZUELAN [0]: tChayma, tCumanagoto, tYao, 
tTamanaco 
В SIERRA DE PERIJA [2]: Japreria, *Yukpa 
y tOPONE-CARARE [0]: tOpone, tCarare 
ii WESTERN GUIANA [3]: Mapoyo, Yabarana, Panare, tQuaca, 
tPareca 
Hi GALIBI [1]: *Galibi (= Carib) 
iv EAST-WEST GUIANA [21]: 
a GROUP 1 [3]: *Wayana, *Apalai, TRoucouyene, tAracaju, 
*Trio 
В GROUP 2 [4]: Wama, Urukuyana, Triometesen, Kumayena 
y GROUP 3 (Б): Pianakoto, Saluma, tPaushi, Cashuena, 
Chikena, *Waiwai 
ó GROUP 4 [5]: tParavilhana, Wabui, Sapara, Yauapery, 
Waimiri, tCrichana, Paushiana, tBonari 
е GROUP 5 [4]: *Makushi, *Pemon, Patamona, Akawaio, 
tArinagoto 
v NORTHERN BRAZIL [1]: 
a TGROUP 1 [0]: TPalmella, tPimenteira 
В GROUP 2 [1]: fYaruma, Chicão 
y tGROUP 3 [0]: tPariri, tApiaka, tArara, TYuma 
b SOUTHERN [8]: 
i SOUTHEASTERN COLOMBIA [1]: tHianacoto, tGuaque, 
Carijona 
ii XINGU BASIN [2]: *Bakairi, Nahukwa 
iii SOUTHERN GUIANA [5]: Ye’cuana, Wayumara, Parukoto, 
*Hishkaryana, *Kashuyana 
B GE-PANO [70]: 
1 MACRO-PANOAN [49]: 
а TCHARRUAN [0]: tCharrua, tChana, tGüenoa 
b tLULE-VILELA [0]: tLule, *tVilela 
с MASCOIAN [5]: Guana, Northern Lengua, Southern Lengua, 
Sanapana, Emok 
d MOSETEN [1]: *Chimane 
e MATACO-GUAICURU [9]: 
i GUAICURUAN [4]: tAbipon, Kadiweu, tGuaicuru, tGuachi, 
Mocovi, *Toba, Pilaga 
ii MATACO [5]: *Chulupi, “Chorote, Nocten, *Vejoz, Maca 


376 8 CLASSIFICATION OF THE WORLD'S LANGUAGES 


f PANO-TACANA [34]: 
i PANOAN [25]: 
о SOUTHERN [3]: *Chacobo, Pacahuara, Caripuna, 
TSinabo, TZurina 
8 SOUTH-CENTRAL [5]: 
[2]: *Amahuaca, Parquenahua 
I YAMINAHUA-SHARANAHUA [3]: Yaminahua, 
Poyanawa, TShipinawa, TTushinawa, *Sharanahua 
y SOUTH-EASTERN [5]: *Cashinahua, Morunahua, 
Catuquina, Shahuanahua, Camanahua 
Š WESTERN [1]: *Cashibo 
e NORTH-CENTRAL [6]: *Capanahua, *Shipibo, 
Isconahua, TRemo, Marubo, TCapuibo, Sensi, Niarawa, 
t+Pichobo, tAraua, TAtsahuaca 
¢ NORTHERN [4]: *Matses, Mayubo, Rémoxbo, Pisabo 
т EASTERN [1]: Kasharari 
ii TACANAN [9]: 
[1]: Yamaluba, tArasa, tGuariza 
a tCHIRIGUA [0]: tMaropa, tChumana, tSapiboca 
8 TIATINAGUA-TACANA [8]: 
I TIATINAGUA [2]: *Ese'ejja, "Huarayo 
II ARAONA-TACANA [6]: 
A ARAONA [3]: Capachene, Mabenaro, Araona 
B CAVINENA-TACANA [3]: 
1 CAVINENA [1]: *Cavinefia 
2 TACANA PROPER [2]: “Тасапа, Reyesano, 
TToromono 
2 MACRO-GE [21]: 
[5]: Rikbaktsa, Fulnio, Guato, Opaye, tOti, Caraja 
a BORORO [3]: 
i BORORO PROPER [2]: “Вогого, Umotina, tAravira 
ii OTUKE [1]: Otuke, TCovare, tCoraveca, TCurucane, tTapii 
b BOTOCUDO [1]: Botocudo 
c TCHIQUITO [0]: tChiquito 
d TKAMAKAN [0]: fKutasho, tMasacara, tMenien, tMongoyo, 
TKamakan 
e tPURI [0]: tPuri, tCoroado, tCoropo 
f YABUTI [2]: Aricapu, Yabuti 
g MASHAKALI [1]: *Mashakali 
h GE-KAINGANG [9]: 
i KAINGANG [2]: 
[0]: tDorin 
a NORTHERN [2]: "Kaingang, Xokleng, tChiqui 
p SOUTHERN [0]: tAmho, fIngain, tGualachi 
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ii GE [7]: 
а CENTRAL [2]: 
I TACROA (0): tAcroa, tAricobe, TGuegue 
П ACUA [2]: *Chavante, Cherente 
В NORTHWEST [5]: *Apinaye, Suya, Kreen-Akarore, 
“Сауаро, *Canela 


LANGUAGEISOLATES’ [5]: "Basque, *Burushaski, “Ket, *Gilyak, *Nahali, 
*tSumerian, *tEtruscan, *tHurrian, tMeroitic 


UNCLASSIFIED‘ [16]: Warenbori (New Guinea), Талгар (New Guinea), Yuri 
(New Guinea), Busa (New Guinea), Nagatman (New Guinea), Porome (New 
Guinea), Pauwi (New Guinea), Massep (New Guinea), Carabayo (Colombia), 
Guaviare (Colombia), Yari (Colombia), Mutus (Venezuela), Yuwana (Venezuela), 
Kohoroxitari (Brazil), Arara (Brazil), Chiquitano (Bolivia) 


PIDGINS AND CREOLES [37]: 

I SWAHILI-BASED [1]: Asian Swahili 
H ZULU-BASED [1]: *Fanagalo 
III ENGLISH-BASED [17]: 

А ATLANTIC [10]: “Sea Island Creole (= Gulla), Guyanese, "Western 
Caribbean Creole, Lesser Antillean Creole, *Krio, Wescos, *Djuka, 
*Saramaccan, *Sranan, Pitcairnese 

В PACIFIC [7]: Hawaiian Creole, Roper River Creole, Torres Strait Pidgin, 
*Tok Pisin, *Bislama, Solomon Pijin, Micronesian Pidgin 

IV DUTCH-BASED [1]: Dutch Creole 
V GERMAN-BASED [1]: Unserdeutsch 
VI FRENCH-BASED [1]: *French Creole 
VII SPANISH-BASED [3]: Bamboo Spanish, *Chavacano, Palenquero 
VIII PORTUGUESE-BASED [8]: *Indo-Portuguese, Macanese, *Malacca Creole, 
Timor Pidgin, *Crioulo, *Fa d'Ambu, Papiamento, Cafundo Creole 


«Тһе difference between a language isolate and an unclassified language is the following. 
A language isolate is a language with a reasonable amount of documentation that has been 
evaluated by scholars for a sufficient period of time to know that the language is not closely 
related to any other known language or group. An unclassified language is typically that of a 
recently discovered ethnic group. In such cases little or nothing is known of the language of 
these people, or what is known has not yet come to the attention of someone who could clas- 
sify it on this basis. It is likely that the unclassified New Guinea languages are members of 
Indo-Pacific; almost certainly the unclassified South American Indian languages are members 
of one of the South American branches of Amerind. This list of unclassified languages merely 
represents those that have come to my attention. There are still uncontacted tribes in South 
America, and peoples whose distinct languages have escaped notice elsewhere in the world, 
especially in Southeast Asia, New Guinea, and Africa. 
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IX RUSSIAN-BASED [0]: TRusso-Norsk 
X NAGA-BASED [1]: “Мара Pidgin 
XI MALAY-BASED [1]: Bazaar Malay 
XII MOTU-BASED [1]: *Hiri Motu 
XIII CHINOOK-BASED [1]: *Chinook Jargon 


INVENTED [2]: "Esperanto, "Interlingua 


Postscript 1991 


During the decade that I compiled апа refined the classification given іп 
this book, no one was more aware of its constant state of flux than L. Nor did 
I imagine that the classification frozen in type in 1987 (and in fact finalized 
іп 1985) would be immune to further change—at every level of the classifi- 
cation. This is not to say that I thought everything was still up for grabs, but 
rather that some reshuffling and further consolidation were by no means 
ruled out. I did not, however, anticipate the significant and unexpected de- 
velopments that have in fact taken place during the past five years. This 
Postscript will then bring things up to date, but the rest of the book is 
unchanged. 

In assembling the classification given in this book, my primary intention 
was to define linguistically valid taxa (i.e., families of languages more 
closely related to each other than to any language outside the family) in a 
hierarchical structure usually expressed in the form of a tree. Above all else 
I wished to avoid putting together languages (and language families) that 
did not really constitute valid taxa and thus do not lead to valid historical 
inferences. At the highest level I would now make two changes. The family 
called Caucasian in section 2.5 should be broken up into two distinct fami- 
lies, North Caucasian and South Caucasian, each of which is more closely 
related to other families than to the other, as discussed below. Following a 
suggestion by Sergei Starostin I now use Caucasian in place of North Cau- 
casian, and Kartvelian in place of South Caucasian. Furthermore, I would 
now remove Korean-Japanese-Ainu from Altaic, leaving Altaic with its tra- 
ditional constituents: Turkic, Mongolian, and Tungus. These taxonomic re- 
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visions raise the number of supposedly independent stocks to 20, shown in 
List 9.1, as well as a few language isolates like Basque, Burushaski, Gilyak, 
TEtruscan, апа tSumerian. 


List 9.1. The World's Language Families (1987) 


KHOISAN SINO-TIBETAN 
NIGER-KORDOFANIAN: YENISEIAN 
I KORDOFANIAN NA-DENE 
II NIGER-CONGO AFRO-ASIATIC 
NILO-SAHARAN KARTVELIAN 
AUSTRALIAN ELAMO-DRAVIDIAN 
INDO-PACIFIC INDO-HITTITE 
AUSTRIC: URALIC-YUKAGHIR 
I AUSTROASIATIC ALTAIC 
II MIAO-YAO KOREAN-JAPANESE-AINU 
Ш DAIC CHUKCHI-KAMCHATKAN 
IV AUSTRONESIAN ESKIMO-ALEUT 
(NORTH) CAUCASIAN AMERIND 


In section 7.3 I briefly discussed several proposals for more comprehen- 
sive classifications that would reduce the number of independent families 
in List 9.1, and I also raised the possibility that all human languages may 
ultimately share a common origin, though I emphasized that such ques- 
tions have long been considered unapproachable by traditional historical 
linguists. Over the past five years a radically different view—but one with 
clear precursors in the late nineteenth and early twentieth centuries—has 
come to be advocated by a small but growing number of scholars who be- 
lieve not only that all presently extant languages are related, but that this 
can be shown by traditional comparative linguistics. A summary of recent 
work from this perspective may be found in Ruhlen (1991а). 


LONG-RANGE COMPARISON 


It is not possible to understand why some linguists see genetic diversity 
where others see uniformity without approaching the problem historically. 
Today the vast majority of linguists, in this country at least, subscribe to the 
view that Indo-Hittite has never been shown to be related to any other lan- 
guage or language family. The explanation usually offered for this total lack 
of relatives is that the comparative method in linguistics is limited to the 
last 5,000--10,000 years, beyond which time phonetic and semantic changes 
will have obliterated all trace of genetic affinity—if such ever existed. 


Postscript 1991 381 


There аге, however, а number of things wrong with this traditional апа 
widely accepted view of the limitations of comparative linguistics. First of 
all, it would indeed be a remarkable coincidence if the Indo-Hittite family 
turned out not only to be the family in terms of which comparative lin- 
guistics was first discovered some 200 years ago, but also the measure of 
the temporal limits of the method itself. The blind acceptance of this belief 
in the twentieth century has led to a view of classification wherein the 
world is conceived of as if it were made up of numerous Indo-Hittite-sized 
families, among which there are simply no apparent connections. This pos- 
sibility cannot be ruled out a priori, for it is theoretically possible that, while 
Indo-Hittite is only 6,000—8,000 years old, the temporal gap between Indo- 
Hittite and its closest relative might be one or two million years. This is 
precisely the traditional belief—except that the date of necessary diver- 
gence is set at ten thousand years or more, not at two million. 

What is really wrong with the traditional view is that it is simply false. 
Consider, for example, Proto-Indo-European *nepét- ‘nephew,’ which 
hardly differs from modern Rumanian nepot ‘nephew.’ This and many other 
similar examples that could be adduced show clearly that the idea that 
everything has changed beyond recognition after 6,000 years cannot be 
taken seriously. Furthermore, it was already known to be false in the nine- 
teenth century, when many genetic connections were discovered among 
Eurasian families by linguists of no less repute than Rasmus Rask. At the 
beginning of the twentieth century, scholars such as Henry Sweet, Holger 
Pedersen, and Alfredo Trombetti pointed out the inconsistency of Indo- 
Europeanists treating the Indo-European pronoun system, m- T vs. t- ‘you’ 
(sg.), as a pattern inherited from Proto-Indo-European, while conven- 
iently ignoring the presence of a similar pattern in Uralic, Altaic, Chukchi- 
Kamchatkan, and Eskimo-Aleut. The interpretation of the same pattern in 
different ways according to one's preconceived expectations and assump- 
tions can hardly be called scientific. Toward the close of the twentieth 
century, scholars such as Vladislav Illich-Svitych, Aron Dolgopolsky, and 
Joseph Greenberg have similarly stressed the distribution of this pro- 
nominal pattern and its genetic import. Despite the constant dissent of a 
few linguists throughout the twentieth century, Indo-Europeanists have in 
general maintained vehemently, down to the present day, the fiction that 
Indo-Hittite has no known relative. They have done this, I believe, for a 
number of reasons (Ruhlen 1988a, Bengtson 1988), some of which Sweet 
(1901: vi) pointed out almost a century ago: 


In philology, as in all branches of knowledge, it is the specialist who most strenu- 
ously opposes any attempt to widen the field of his methods. Hence the advocate of 
affinity between the Aryan [Indo-European] and the Finnish [= Finno-Ugric] lan- 
guages need not be alarmed when he hears that the majority of Aryan philologists 
reject the hypothesis. In many cases this rejection merely means that our specialist 
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has his hands full already, and shrinks from learning a new set of languages. . . . 
Even when this passively agnostic attitude develops into aggressive antagonism, it 
is generally little more than the expression of mere prejudice against dethroning 
Aryan from its proud isolation and affiliating it to the languages of yellow races; or 
want of imagination and power of realizing an earlier morphological stage of Aryan; 
or, lastly, that conservatism and caution which would rather miss a brilliant discov- 
ery than run the risk of having mistakes exposed. 


Furthermore, many of the early proposals—but by no means all—were 
quite bad and were justifiably rejected by Indo-Europeanists, in particular 
certain comparisons with Semitic that were based mostly on typological 
factors and on the earnest hope that the languages of Christian Europe 
were related to the languages of the Bible. An additional factor has been 
the ever-increasing specialization in academia throughout the twentieth 
century, which has encouraged scholars to occupy a narrow and cozy 
niche—usually a single language family or even a single language for lin- 
guists—to which all their attention is devoted. Under these circumstances, 
widely distributed traits are not likely even to be noticed, much less un- 
derstood. An entirely analogous situation befuddled the classification of 
American Indian languages throughout this century, as discussed in sec- 
tion 6.3.1. 

Another factor that has prevented more comprehensive taxonomies 
from developing has been a confusion between classification and reconstruc- 
Hon, the latter usually requiring regular phonological correspondences. 
Thus, Sarah Thomason (1990) claims that "the traditional proof of genetic 
relationship . . . is the discovery of recurring sound/meaning correspon- 
dences that permit phonological reconstruction, together with the estab- 
lishment of systematic correspondences in other grammatical subsystems 
that permit grammatical reconstruction." In a similar vein Theodora Bynon 
(1977: 271) claims that Greenberg's "use of basic vocabulary comparison not 
simply as a preliminary to reconstruction but as a substitute for it is... 
controversial." This is a surprising statement, since Greenberg has never 
claimed—and never would claim—that classification is a "substitute" for 
reconstruction. The two are distinct enterprises, and classification does not 
require reconstruction. In fact, just the opposite is true, in that reconstruc- 
tion does require a prior classification; otherwise there would be nothing to 
reconstruct. 

There is a final element in the question of "long-range comparison" that 
needs to be dispelled. Many linguists believe that Greenberg uses uncon- 
ventional, even intuitive, methods that are quite different from the meth- 
odology of traditional historical linguistics, which is usually equated with 
reconstruction and regular sound changes. This view is clearly evident in 
the remarks of Thomason and Bynon above and has recently appeared 
widely in the media in this country. Yet nothing could be further from the 
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truth. The reasons for this widespread misconception are, | believe, the fol- 
lowing. Historical linguistics has always been dominated by specialists in 
Indo-Hittite, for obvious historical reasons. But for Indo-Hittite the prob- 
lem of classification was essentially resolved by Sir William Jones in 1786 
when һе outlined five branches of this family. Thus for Indo-Hittite, classi- 
fication has never been a central problem, though a few new branches were 
later discovered. 

When, however, one is faced not with an already defined family like 
Indo-Hittite, but with a thousand African languages whose relationships 
are unknown, the problems are quite different. At this stage one must 
begin with the comparison of the languages involved in order to detect 
closely related families, and then, by the comparison of these, higher-level 
taxa. Greenberg has called this process “multilateral comparison,” but it is 
really no different from the initial step of the comparative method. As we 
saw in Chapter 6, Greenberg's method of classification “by inspection” has 
been derided by American Indianists who reject the idea that such a simple 
method could produce reliable results (Goddard 1987: 656). This, however, 
is not the view of the eminent evolutionary biologist Ernst Mayr (1988: 268): 
“From the earliest preliterary days, organisms were grouped into classes 
by their outward appearance, into grasses, birds, butterflies, snails, and 
others. Such grouping ‘by inspection’ is the expressly stated or unspoken 
starting point of virtually all systems of classification." The problem with 
historical linguistics is that this initial step in the comparative method has 
always been “unspoken,” and Greenberg’s attachment of a new name to 
this process has led some to believe he is employing a new and unreliable 
method. In fact, he is simply starting the only way one can. 

In the final analysis, of course, whether or not Indo-Hittite can be shown 
to be related to other families depends not on a priori methodological con- 
siderations, but on the empirical evidence that can be adduced in favor of 
such relationships. Let us turn then to a review of the evidence that has 
been put forward by the advocates of a Eurasiatic, or Nostratic, family in 
which Indo-Hittite is but one component. 


Nostratic/ Eurasiatic 


The Nostratic family, as defined by Illich-Svitych, connects Indo-Hittite 
with five other families, whereas Greenberg's Eurasiatic family overlaps 
this definition, as seen in Fig. 9.1. The Nostratic family is supported by 
over 700 etymologies (Illich-Svitych 1967, 1971-84; Dolgopolsky 1984, to 
appear), and Greenberg (to appear) offers about 500 etymologies in sup- 
port of Eurasiatic. Finally, Bomhard (1990) proposed 483 Nostratic ety- 
mologies, though he argues for a different set of sound correspondences 
from those used by the Soviets. 
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Afro-Asiatic 

Kartvelian 

Elamo-Dravidian 
NOSTRATIC ( Indo-Hittite 

Uralic-Yukaghir 

Altaic 

Korean 

Japanese EURASIATIC 

Ainu 

Gilyak 

Chukchi-Kamchatkan 

Eskimo-Aleut 


Fig. 9.1. Comparison of Illich-Svitych's Nostratic and Greenberg's Eurasiatic 


The differences between Nostratic and Eurasiatic derive from the different 
methodologies used by the Nostraticists and by Greenberg. Since its first 
adumbration by Holger Pedersen at the start of this century, Nostratic has 
been defined—somewhat amorphously—-as those language families that 
can be shown to be related to Indo-Hittite. Furthermore, Nostraticists insist 
on comparing reconstructed proto-languages for each family and thus ex- 
clude, for the time being anyway, those families that have not yet been re- 
constructed in sufficient detail for broader comparison. Two points should 
be made. First, this definition of Nostratic is unabashedly Eurocentric and 
really is not a proper taxonomic definition at all. Second, a taxonomic defi- 
nition that depends on the quality of the linguistic literature stands little 
chance of constituting a valid taxon. 

Greenberg, on the other hand, approaches the problem of Indo-Hittite 
relatives from a totally different perspective, that of a taxonomist who 
merely classifies the entities under consideration, but without requiring 
reconstruction and without attributing any special importance to Indo- 
Hittite. Furthermore, Greenberg does not deny that Afro-Asiatic, Kart- 
velian, and Dravidian are related to the families he includes in Eurasiatic, 
but he considers these relationships more remote. With the growing aware- 
ness of the importance of classification— prior to reconstruction—by at 
least some Nostraticists, the differences between Eurasiatic and Nostratic 
show signs of reconciliation. With regard to Afro-Asiatic there seems to be 
a growing consensus that this family is related to Nostratic as a sister lan- 
guage rather than as a daughter language, as Greenberg and Harold Flem- 
ing had already proposed. Recently Starostin (1989a) has expressed the 
same view: "I prefer presently to exclude Afro-Asiatic material from the 
Nostratic comparisons. This does not mean, of course, that Afro-Asiatic 
parallels for Nostratic roots . . . are all accidental ‘look-alikes.’ They may 
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purely reflect a relationship on some deeper level between Afro-Asiatic and 
Nostratic." Furthermore, in the same paper Starostin presents substantial 
and convincing evidence connecting Nostratic and Dene-Caucasian (see 
below), but he does not claim that the Dene-Caucasian families are Nostratic 
(as the traditional definition would require). He sees rather that they are two 
higher-level families that are related to each other and to Afro-Asiatic. Fi- 
nally, Starostin (1990) suggests that Dravidian is the most divergent branch 
of Nostratic (minus Afro-Asiatic), thus bringing his views further in line 
with those of Greenberg. Starostin's perspective is similar to that of Green- 
berg in that it concentrates on defining the hierarchical relationships of the 
world's language families rather than simply trying to prove that X is re- 
lated to Y. 

With regard to the Asian families that Greenberg includes in Eurasiatic— 
but which have been generally excluded from Nostratic—there is really 
little conflict. These families have all been included in Nostratic by one or 
another scholar, and Dolgopolsky has included Eskimo-Aleut in Nostratic 
from his earliest work. Thus, these families have generally been excluded 
from Nostratic because they had not been reconstructed, not because they 
did not belong. This leaves Kartvelian as the sole remaining difference be- 
tween Eurasiatic and Nostratic, and it seems certain that this difference too 
can eventually be resolved. 


Amerind 


The publication of Greenberg's (1987) revolutionary classification of 
American Indian languages into just three families provoked a storm of 
controversy that parallels in many ways that which greeted his African 
classification some forty years ago. Even before the appearance of Green- 
berg's book—and without even seeing its contents—Lyle Campbell (1986) 
called for Greenberg's classification to be "shouted down." Under such cir- 
cumstances it is not surprising that Campbell later delivered a thoroughly 
negative review of Greenberg's book to the journal Language (Campbell 
1988). Terrence Kaufman (1990: 63) claims that “many of Greenberg’s (1987) 
and Ruhlen's (1987) [this book] avowed values are subversive and should 
be explicitly argued against." Ives Goddard (1987: 657) gave an equally 
negative review of Greenberg's book, claiming that “the equations it gener- 
ates do not require any historical explanation." A minority of scholars, 
however, recognized the importance of Greenberg's work from the out- 
set, and Dell Hymes's (1987: 659) review stands in stark contrast to the as- 
sessments of Campbell, Kaufman, and Goddard: "Greenberg has demon- 
strated what has been suspected, namely, that almost all the indigenous 
languages of the New World are genetically related as what he appropri- 
ately calls 'Amerind.' So much, I think, is certain." 
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My own study of Greenberg’s book, and of the 23 volumes of notebooks 
upon which it is based (Greenberg 1981), has led me to conclude that the 
case for Amerind is even stronger than Greenberg's book implied, which is 
hardly surprising for a pioneering work of this nature. First, there are 
about 150 additional Amerind etymologies that Greenberg split among 
various Amerind subgroups instead of uniting them as Amerind etymolo- 
gies (Ruhlen 1989a). Second, the unpublished Amerind notebooks contain 
additional Amerind etymologies, as well as substantial additional evidence 
for those already listed in the book. Finally, I have found that claims (Thoma- 
son 1990) that the Amerind pronominal pattern n-/m- 'I/you' is an archaic 
residue of stable nasal sounds is demonstrably false. As shown in Ruhlen 
(1989b), this pattern is virtually absent outside of the Amerind family, 
though within Amerind it is the most frequent pronominal pattern, and its 
wide distribution in the Americas has been noted for almost a century. 

In light of what has been said in section 6.3.1 concerning the taxonomic 
nonsense advocated by many American Indian specialists, it is hardly sur- 
prising that the practitioners of the failed Phase III paradigm should react 
hostilely to its demise. What is surprising is the extent to which these critics 
have indulged in outright distortions of Greenberg’s book, charges that are 
answered in detail in Ruhlen (1988b) and Greenberg (1989, 1990a, 1990b). 
Since many scholars read reviews of books they would never actually look 
at, no doubt some such scholars have accepted what Greenberg's critics 
have said in their reviews at face value, without ever checking Greenberg's 
book. It is for that reason that a few examples of this criticism are in order. 

In his review in Language, Campbell (1988: 594-95) discusses the "Xinca- 
Lenca problem" (i.e. are these two languages related?): "An example of one 
of these often-repeated long-shot proposals is instructive. The Xinca-Lenca 
hypothesis was first proposed by Lehmann in 1920 . . . and accepted by 
Sapir; it became entrenched in the literature and is reflected in Language in 
the Americas as part of Greenberg's Chibchan. Nevertheless, Lehmann's evi- 
dence is the only data ever published in support of the hypothesis [emphasis 
mine]. | assess his evidence in its entirety. . . . In sum, not one single cog- 
nate set proposed by Lehmann is without serious problems; the Xinca- 
Lenca hypothesis thus has no support." Campbell apparently is trying to 
create the impression that Greenberg's book is little more than a stringing 
together of previously ill-supported groupings like Xinca-Lenca. Anyone 
who takes the time to read Greenberg's book will find that Xinca and Lenca 
figure in no less than 78 of Greenberg's lexical etymologies for Chibchan- 
Paezan and Amerind. Thus Campbell’s claim that the only evidence con- 
necting these two languages is Lehmann’s is simply false. 

Goddard (1987: 657) criticizes Greenberg for loose semantics, meaning 
that the semantic connections are implausible. As an example he cites 
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Greenberg's grouping of words meaning “excrement,” "night," and “grass” 
in a single etymology. Most linguists would agree that these three mean- 
ings have little in common. However, an examination of the etymology it- 
self reveals Goddard's trick, for what he has done is to cite the furthest se- 
mantic connections in the etymology while completely excluding the basic 
core meanings. What we find in the etymology in question are words de- 
noting “dark in color." In Almosan the meaning is "black" and the overall 
distribution of the meanings in North and South America suggests that this 
was the original meaning. In Keresiouan we find “dark in color” in Iro- 
quoian and "green" in Keresan. In Penutian the meaning has completely 
shifted to "green" and its close semantic connections “grass” and "blue." 
In South America Macro-Ge preserves the original meaning of “black” in 
Proto-Ge, while other languages in this group have shifted the meaning to 
"dirty." Finally, in the Equatorial group the meaning is "excrement." As in 
the previous example, a comparison of Greenberg's book with the allega- 
tions of his critics reveals what can only be called a willful distortion of the 
evidence. It should be noted that Campbell (1988: 600) also criticizes Green- 
berg’s lax semantic standards: “G[reenberg]’s forms are quite permissive in 
semantic latitude. Semantic equations such as the following are not con- 
vincing: 'excrement/night/grass.' ” 

While critics have spent most of their time trying to discredit Green- 
berg's classification, others have begun to compare Greenberg/s three fami- 
lies with the world's other language families in order to identify the origin 
of each. Already in 1986 Greenberg, Turner, and Zegura suggested, in an 
article comparing dental, serological, and linguistic data, that there were 
three migrations from Asia to the New World, the first resulting in the 
Amerind family, the second in the Na-Dene family, and the third, the 
Eskimo-Aleut family. Powers (1990) provides archaeological evidence for 
three distinct migrations to the New World, at roughly 14,000 B.P., 11,000 
В.Р., and 4,000 В.Р. Linguistic evidence for three distinct migrations, which 
would require a comparison of these three families with Old World fami- 
lies, was not given in the Greenberg, Turner, and Zegura article, but has 
subsequently been developed for all three migrations. Eskimo-Aleut, as the 
easternmost member of Eurasiatic, probably represents the most recent mi- 
gration. The next most recent incursion into the Americas was carried out 
by the ancestors of the Na-Dene, which was shown during the 1980s to be 
related to a different set of Old World families in a phylum now called 
Dene-Caucasian (see below). Finally, the Amerind phylum itself, which 
constituted the first migration sometime before 12,000 B.P., is most closely 
related to Eurasiatic as a whole. That is, though it is not a member of 
Eurasiatic, it is closer to Eurasiatic than to any other family. Linguistic evi- 
dence for this connection may be found in Ruhlen (1989e) and is summa- 
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rized іп Ruhlen (1990а). Ап investigation of the internal structure of the 
Amerind family is given in Ruhlen (1991b). 


Dene-Caucasian 


During the early part of this century Edward Sapir, after making a string 
of brilliant taxonomic discoveries (discussed in Chapter 6), came to the con- 
clusion that the Na-Dene Family, which he himself had identified, was 
more closely related to the Sino-Tibetan family than to other native Ameri- 
can languages. In a letter to Alfred Kroeber in 1920 he wrote: "I am just 
now interested in another big linguistic possibility. I tremble to speak of it, 
though I've carried the germinal idea with me for years. I do not feel that 
Na-dene belongs to the other American languages. . . . Do not think me 
an ass if I am seriously entertaining the notion of an old Indo-Chinese 
[7Sino-Tibetan] offshoot into N.W. America. . . . At least I know that 
Déné's a long shot nearer to Tibetan than to Siouan" (Golla 1984: 350). Ex- 
actly one year later he wrote to Kroeber again regarding the same subject, 
this time even more emphatic than before: 


I have long wanted to write you about Nadene and Indo-Chinese, but my evidence 
accumulates so fast it is hard to sit down and give an idea. Let me say this for the 
present. If the morphological and lexical accord which I find on every hand be- 
tween Nadene and Indo-Chinese is “accidental,” then every analogy on God's 
earth is an accident. It is all so powerfully cumulative and integrated that when you 
tumble to one point a lot of others fall into line. Iam now so thoroughly accustomed 
to the idea that it no longer startles me. For a while I resisted the notion. Now I can 
no longer do so. (Golla 1984: 374) 


For reasons that are not entirely clear, Sapir never published any evi- 
dence for this connection. It seems likely that in the wake of Michelson's 
hostile critique of Algonquian-Ritwan (see section 6.3.1) and Pliny God- 
dard's equally negative review of Na-Dene (see section 6.2.1), Sapir decided 
that even more remote genetic connections than these would simply not be 
considered seriously by other linguists and he effectively ceased taxonomic 
work after around 1925. Sapir's unpublished Na-Dene-Sino-Tibetan com- 
parative notebooks are now stored in the American Philosophical Society 
library in Philadelphia, and Americanists view the whole idea of a genetic 
connection between Na-Dene and Sino-Tibetan as yet another of Sapir's 
failed long-shots. According to Campbell (1988: 593), "Needless to say, no 
specialist today embraces this claim." 

Ironically, one of the major discoveries in linguistic taxonomy during the 
1980s has been the elaboration of a Dene-Caucasian family, in which Na- 
Dene and Sino-Tibetan are two components. This line of research began in 
the Soviet Union and for a while escaped the attention of American schol- 
ars. The initial step was Sergei Starostin's (1984) comparison of (North) 
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Caucasian, Sino-Tibetan, and Yeniseian (whose sole surviving language is 
Ket), in which he showed that these three families formed a higher-level 
family that he called Sino-Caucasian. The following year V. A. Chirikba 
presented evidence connecting Basque with (North) Caucasian, an idea 
that had already been suggested by Alfredo Trombetti in 1926. A year later 
Sergei Nikolaev (1986), in a comparison of (North) Caucasian and Na- 
Dene, showed that these two families too were unmistakably related to 
each other, and the overall family was renamed Dene-Caucasian to indicate 
this fact. Sapir's idea of a Sino-Tibetan-Na-Dene connection was thus res- 
urrected by Soviet scholars, though without ever comparing the two fami- 
lies that had led Sapir to his conclusions. 

The Soviet work on Dene-Caucasian became better known in this coun- 
try at the Rice University symposium on the genetic classification of lan- 
guages in March, 1986 (Shevoroshkin and Manaster-Ramer 1991). Since 
that time John: Bengtson (1991a) has provided substantial additional sup- 
port for Dene-Caucasian by making the first multilateral comparison of the 
family, which for Bengtson includes Burushaski and Sumerian, as well as 
the families compared by the Soviets. Furthermore, Bengtson (1991b) pro- 
poses that Basque, (North) Caucasian, and Burushaski form a subfamily, 
which he calls Macro-Caucasian, within the Dene-Caucasian phylum. (It 
should be noted that Claude Boisson [1989] has made an exhaustive com- 
parison of Sumerian with both Nostratic and Dene-Caucasian, and he 
leans toward a stronger connection with the former than with the latter.) I 
have compared Starostin's (1982) Proto-Yeniseian reconstructions with Na- 
Dene etymologies (Ruhlen 1989c) and with Nahali, providing additional 
connecting links for Dene-Caucasian (Ruhlen 1989d). (Heinz-Jürgen Pin- 
now, however, considers Nahali a member of Austroasiatic.) In light of the 
various proposals mentioned above I would now incline toward including 
the following families in Dene-Caucasian: Basque, (North) Caucasian, Bu- 
rushaski, Sumerian, Nahali, Sino-Tibetan, Yeniseian, and Ма-Пепе. The 
inclusion of Nahali and Sumerian seems to me the least secure. 


MONOGENESIS 


If we accept both Greenberg's Eurasiatic proposal and the Soviets' Dene- 
Caucasian phylum, then the number of distinct families is reduced to about 
a dozen, as shown in List 9.2, and a logical next step would be to compare 
these families with each other to see whether further consolidation is pos- 
sible. Such research has been practically forbidden in this century by the 
myth that Indo-Hittite has no known relatives and the related myth that the 
comparative method in linguistics is limited to a time depth of 5,000— 10,000 
years (Ruhlen 1988a, 1990b), conveniently the assumed age of Indo-Hittite. 
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List 9.2. Тһе World's Language Families (1991) 


IKHOISAN ҮШ AFRO-ASIATIC 
H NIGER-KORDOFANIAN: ІХ ELAMO-DRAVIDIAN 

A KORDOFANIAN X KARTVELIAN 
B NIGER-CONGO XI EURASIATIC: 

III NILO-SAHARAN A INDO-HITTITE 

IV AUSTRALIAN B URALIC-YUKAGHIR 

V INDO-PACIFIC C ALTAIC 

VI AUSTRIC: D KOREAN-JAPANESE-AINU 
A AUSTROASIATIC E Gilyak 
B MIAO-YAO F CHUKCHI-KAMCHATKAN 
C DAIC G ESKIMO-ALEUT 
D AUSTRONESIAN ХП AMERIND 

VII DENE-CAUCASIAN: 

A. Basque 
B (NORTH) CAUCASIAN 
C Burushaski 
D Nahali 
E SINO-TIBETAN 
F YENISEIAN 
G NA-DENE 


With these myths now demolished by the work of the Soviet Nostraticists 
and the impending volumes by Greenberg (to appear) on Eurasiatic and 
Dolgopolsky (to appear) on Nostratic, a number of linguists have once 
again turned to the perennial problem of the interrelationships of all the 
world's language families. John Bengtson and I (1988), employing Green- 
berg's multilateral approach, have proposed some two dozen etymologies 
connecting all of the world's language families, and many more are known. 
А careful comparison of all the world's language families—until recently 
almost an illicit enterprise (Matisoff 1990)—will uncover many additional 
connections, and within a few years, I believe, the fact that all extant hu- 
man languages share a common origin will be as casually accepted as is the 
opposite opinion today. Now that the dike of Indo-Hittite isolation has fi- 
nally burst, it is only a matter of time before the resulting wave spreads to 
all of the world's language families. Independent of the work of Bengtson 
and Ruhlen (1988), Soviet linguists have been approaching this same prob- 
lem from their own Nostratic perspective of comparing reconstructed fami- 
lies, but their conclusions are not substantially different from ours. Staros- 
tin (1989a) has shown many convincing connections between Nostratic and 
Dene-Caucasian, and Ша Peyros (1989) has adduced evidence connecting 
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these two families with Austric. Shevoroshkin (1989а) presented evidence 
connecting Nostratic, Dene-Caucasian, Amerind, Australian, and Indo- 
Pacific, and Kaiser and Shevoroshkin (1988) noted similarities between No- 
stratic and the two African phyla, Nilo-Saharan and Niger-Kordofanian. 
The Czech linguist Vaclav Blazek (1989) has also made important contribu- 
tions to global comparisons. Both approaches thus arrive at essentially the 
same conclusion, namely, that all of the world's language families—and 
hence all of the world's extant languages—share а common origin. This 
opinion is admittedly at variance with the prevailing views of the great ma- 
jority of linguists, just as the Amerind family is discounted by the majority 
of American Indian specialists, but I believe both hypotheses are correct. 
Furthermore, they are not in my opinion particularly difficult taxonomic 
questions, and the failure to recognize both taxa is due more to a lack of 
understanding of basic taxonomic principles than to the alleged paucity 
of evidence. The really difficult —and to my mind most interesting—taxo- 
nomic problems involve determining the degrees of relationship among the 
world's language families, not the fact of relationship itself. 

It is important here to emphasize once again, as I did in section 7.3, that 
the fact that all extant languages appear to form a single family does not 
alone imply the correctness of monogenesis. We only have information on 
the languages spoken by modern humans, Homo sapiens sapiens. We have 
no direct evidence on the communicative systems used by earlier homi- 
nids, whether contemporaneous with modern humans, such as the Nean- 
derthals, or much earlier, such as Homo erectus or Australopithecus afarensis. 
How then did language develop? There are basically two poles to this ques- 
tion, though one can conceive of numerous intermediate possibilities. At 
one pole is the possibility that all hominids prior to Homo sapiens sapiens 
were speechles~ so to speak, and that language developed solely—and 
probably rapidly, like a child learning a language—in modern humans, 
leading perhaps to the extinction of all other hominids. At the other pole is 
the idea that linguistic abilities in hominids developed gradually over a 
very long time period so that earlier hominids would have had some kind 
of language which over several million years developed in modern humans 
into what we call human language. I personally find the idea that language 
could have evolved ex nihilo to its present complexity in 100,000 years or so 
implausible, but it is hard to back up this belief with any kind of hard evi- 
dence. My view is that all extant languages have developed from a single 
language that was spoken perhaps 100,000 years ago, but that this lan- 
guage was in no sense primitive. It was in all probability a language not 
qualitatively different from the 5,000 languages spoken today. The mind of 
modern humans has not changed since our appearance on the earth, and I 
doubt that our languages have either. Stephen Jay Gould (1988: 18-20) ex- 
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presses this same point of view, though not in the context of language, 
when he writes that 


At some point, modern Homo sapiens split off from an ancestral group and founded 
our own species. They were us from the beginning, are us now, and shall be us 
until we blow ourselves up or genetically engineer ourselves out of current exis- 
tence. Homo sapiens . . . is an entity, not a tendency. Once we arose as a species— 
and 100,000 to 250,000 years ago in Africa is our best current assessment of time 
and place— we were probably pretty much ourselves in terms of mental organiza- 
tion. . . . I would reverse our usual perspective. . . . We are stunned by what our 
Ice Age ancestors could do. I think that we should look the other way, onward from 
the origin of our species. Is it not remarkable that all of what we call civilization, all 
of agriculture, of the arts and sciences, of technology, of life in complex cities, could 
be built by the unchanged power resident in the mind of a creature who evolved a 
large brain for reasons obviously unrelated to this future potential? 


And it is not, in my view, likely that this creature accomplished these 
remarkable achievements without a fully developed capacity for language 
from the inception. 


HUMAN GENETICS 


Ironically, the event perhaps destined to have the greatest impact on his- 
torical linguistics was the discovery by human geneticists during the 1980s 
that the structure of the human population based on genes— its basic clus- 
ters and their hierarchical arrangement in a family tree—bears a striking 
resemblance to proposals made by linguists on the basis of human lan- 
guage. This discovery was unexpected, at least among linguists, for two 
reasons. First, it had become virtually an article of faith during the twen- 
tieth century that "there is no deterministic connection between language 
and gene pools or culture" (Campbell 1986: 488). And second, even if there 
was some correlation between the evolution of genes and languages, lin- 
guistic evolution is so much swifter than genetic evolution that the two pro- 
cesses could hardly be expected to result in the same groupings (Bateman 
et al. 1990a: 9—10). Yet this is exactly what L. L. Cavalli-Sforza and his col- 
leagues reported in 1988 in a paper that has already sent shock waves 
through the field of historical linguistics. 

It is of course true that any person, with any genes, can learn any hu- 
man language with native proficiency, provided only that he is raised as a 
child in an environment where the language is spoken. Black Americans 
speak English, an Indo-Hittite language, but genetically they are very simi- 
lar to black Africans, who speak non-Indo-Hittite languages. Thus in this 
case black Americans are genetically like black Africans, but linguistically 
like white Europeans. The reason for this discrepancy between genes and 
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languages is hardly a mystery. As is well known, it was the European slave 
trade which kidnapped blacks from Africa and brought them to the Amer- 
icas to sell. Over time the original African languages were replaced by En- 
glish in the U.S.A., and by French, Spanish, or Portuguese elsewhere in 
the Americas. What Cavalli-Sforza and colleagues found, however, is that 
when one eliminates such anomalies, which have known historical expla- 
nations or prehistorical explanations that can reasonably be inferred, a 
generally consistent genetic tree emerges that resembles remarkably the 
linguistic classification given in this book. As to why there should be such 
a correlation, Cavalli-Sforza and colleagues (1990: 18) offer the following 
answer. 


The central question is why there should be any congruence between genetic and linguistic 
evolution. The main reason is that the two evolutions follow in principle the same 
history, namely, sequence of fissions. Two populations that have separated begin a 
process of differentiation of both genes and languages. These processes need not 
have exactly constant evolutionary rates, but rough proportionality with time is a 
reasonable expectation for both. They should therefore be qualitatively congruent, 
except for later events such as gene flow or language replacement. These may blur 
the genetic and the linguistic picture, but our conclusion is that they do not obscure 
it entirely. 


Cavalli-Sforza's comparison of the genetic tree and the linguistic phyla is 
shown in Fig. 9.2. 

Two groups of linguists in particular have expressed outrage at the pro- 
posed genetic findings. First of all, Indo-Europeanists were dismayed to 
find Indo-Hittite connected to other Eurasian families in a fashion reminis- 
cent of the Nostratic/Eurasiatic proposal. The noted Indo-Europeanist Paul 
Hopper (1989) lashed out against the alleged linguistic-genetic correlations 
by claiming that "linguistic hustlers" (Nostraticists and Greenbergians) 
had "induced . . . responsible human biologists" (Cavalli-Sforza and col- 
leagues) into believing that there is some correlation between the distri- 
bution of genes and the distribution of languages. Hopper refers derisively 
to families such as Niger-Kordofanian, Afro-Asiatic, Nilo-Saharan, and 
Austroasiatic as "broad-based guesses," which are not to be confused with 
“factually established genetic categories" like Indo-Hittite. 

The other group of scholars who were more than dismayed by the ge- 
netic findings were the American Indian specialists, who were in the midst 
of their orgy of anti-Greenberg invective (Ruhlen 1988b). At a time when 
Greenberg’s book was being condemned by one American Indian expert 
after another, Cavalli-Sforza found that, on the basis of their genes, the na- 
tive populations of the Americas fell into the same three groups that Green- 
berg had identified on the basis of language one year before. For scholars 
like Goddard and Campbell, such findings were devastating to their posi- 
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AFRICA / A /======, W.African --------- Niger-Kordofanian ---^ 
/=====================m=s=====s= | N /*-, Bantu (ling.) -----/ |-------- Congo-Saharan 
i | \==. Nilo-Saharan (ling.)------- Nilo-Saharan--------- / 
| \__/ши===шш=шш=ш=шшшзз, San (Bushmen) ------------- Кһо1зап 
| Vu——-—Ámám-x»-. Ethiopian -------- \ 
| /=======, Berber,N.Africa --|-------- Afro-Asiatic -------- V 
| [=] 7 /-. S.W.Asia --------- / | 
| [==] Аша) A=, Iranian ---------- ÀN |-------- Nostratic 
| /=| | \====, European --------- I Indo-European ------- [=o==\ 

l | | We-------. Sardinian -------- | I | 
| CAUCASOID /=====-======| \ /=======, Indian ----------- / | | 
| | | \=ж=ж==жж, Dravidian (ling.) --------- Dravidian ----------- | | 
=== | | \========ш=хышш», Saami ------------4 Uralic -------------- І----| 
| | / /--. Uralic (ling.) ---/ | | 
| | /=1 \==, Mongol ----------- \ | |--- Eurasiatic 
| NORTH EURASIA /=====| NORTHEAST | V /==w==, Tibetan ---------- І--- (1) | | 
| | | ASIA /====| \__/==, Korean ----------- |-------- Altaic ------------ --/----| 
| | | /===| | \==, Japanese --------- ! : | 
| | | і i Vemesm=====, Ainu ------------- ! - i 
| | | | \ /=========, N.Turkic (ling.) -/ š | 
| | \=т======] ARCTIC \__/===жа=, Eskimo -------------------- Eskimo-Aleut 3----| 
| | i \======. Chukchi, ---<--------------- Chukchi-Kamchatkan...:----/ 
| | I — /=====. S.Amerind -------- \ : 
| | | — / Ажяшыж, C.Amerind -------- |== Amerind ........ .%%442 
V=w============== | AMERICA Ñ / (еттжкемеш, N.Amerind -------- / 
| Қәнжажжежжламмжжн, Na-Dene (ling.) ----------- Na-Dene 
| MAINLAND AND /=======, S.Chinese ----------------- Sino-Tibetan 
| INSULAR SOUTHEAST | /===,. Mon-Khmer ----------------- Austroasiatic ------- ÓN 
1 ASIA /======== | Қаш, Thai ---------------------- Daic ---------------- | 
| | |=======, Indonesian --------- N |-------- Austric 
| ii [=======, Malaysian ---------- | | 
SOUTHEAST | | | N=z======, Philippine --------- l Austronesian -------- / 
ASIA \===| PACIFIC А /=======m=======, Polynesian ---------| 
| ISLANDS ÀN Г =========, Micronesian -------- | 
| \===а==е==, Melanesian ---------/---(2) 
МЕН GUINEA, \ /===========x=m=====m==a=, New Guinea ---------------- Indo-Pacific 
AUSTRALIA Ve=========n===========a==, Australian ---------------- Australian 
=== Ka несі [Sa [ses [eR Sarna [Ser | 
0.030 0.024 0.018 0.012 0.006 0.000 Genetic distance 


In this tree, (ling.) indicates populations pooled on the basis of linguistic classification. (1) Tibetans are associated 
genetically with the Northeast Asian cluster, but linguistically with the Sino-Tibetan phylum. (2) Melanesians speak, 
in part, Indo-Pacific languages. Genetic data are not currently avallable for Kartvelian or (North) Caucasian populations. 
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tion and could not ро unchallenged, for if another scientific discipline— 
with its own methods and its own data—should arrive at the same classifi- 
cation as Greenberg, such independent corroboration can hardly be easily 
dismissed. The American Indianists' strategy had been to unanimously 
condemn Greenberg's book, distorting and misrepresenting it in the pro- 
cess, so that other scholars would simply accept their verdict without 
checking the facts themselves. This strategy had worked remarkably well 
for Truman Michelson and Pliny Goddard against Edward Sapir's pro- 
posals around 1920, and it continued to be widely used throughout the 
twentieth century. The danger that the genetic findings posed was that out- 
siders, who were not likely simply to accept the verdict of self-appointed 
experts, were being brought into the debate. Furthermore, these outsiders 
had found exactly what Goddard did not want, Greenberg's three families. 

Goddard's response to these unwelcome developments was a joint ar- 
ticle with six biologists from the Smithsonian Institution (Bateman et al. 
1990a), who began by noting with alarm the "instant fame" of Cavalli- 
Sforza's paper “which received the rare accolade of rapid review in Sci- 
ence, . . . Nature, . . . and Natural History." Bateman et al. claim that 
Cavalli-Sforza’s "main conclusion, that there is ‘considerable parallelism 
between genetic and linguistic evolution’ . . . is undermined by analysis of 
an inadequate database by inappropriate methods" (2). But they are later 
forced to admit (Bateman et al. 1990b: 177) that they are "unfamiliar with 
the data, methods, and models of human population genetics." This raises 
the interesting question of how they knew a database with which they 
were unfamiliar was inadequate. Later they make it clear that the existing 
literature on human genetics is not even worth investigating. We should 
rather start from scratch at the level of the individual human: "Extant homi- 
nids are sufficiently abundant and widespread to allow large-scale demo- 
graphic sampling" (1990a: 13). Just round up a batch of these critters and 
test them. In sum, the article by Bateman et al. is replete with pretentious 
terminology (such as the use of "extant hominids" in place of "people"), an 
amusing figure showing that "languages evolved after the ability to speak" 
(10), a plethora of demonstrably false claims ("Languages lack the wide- 
spread but nonuniversal attributes that characterize major groups of orga- 
nisms” [5] conveniently ignores the Eurasiatic pronominal pattern m-/t- “U 
you’ and the Amerind pattern n-/m- ‘I/you.’), and a virtual absence of any 
evidence that Native American populations do not fall into the three groups 
as claimed by Cavalli-Sforza and Greenberg. Bateman et al.’s final argument, 
that they cannot judge the work because they do not have access to either the 
database or the methodological considerations upon which the article was 
based, seems disingenuous, considering that such information could hardly 
be given in a paper with a strict five-page limitation. However, the work 
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from which this article was extracted, which runs to almost 1,000 pages, is 
now approaching publication (Cavalli-Sforza et al. 1991), and the chapter 
devoted to the Americas, which I have read in manuscript, leaves little doubt 
that if Greenberg's Amerind had not already existed, Cavalli-Sforza and col- 
leagues would have had to invent such a term themselves. Such is the story 
that human genes tell for the Americas. 

It should be noted that Cavalli-Sforza's finding of a high degree of cor- 
relation between linguistic and genetic classifications is but one of many 
such discoveries made for different regions of the world during the 1980s, 
work that is briefly reviewed in Ruhlen (1991a). For Africa, Laurent Ex- 
coffier and his colleagues (1987: 151) found that "genetic differentiation 
clearly parallels the clustering of major linguistic families." For the island of 
Sardinia, Alberto Piazza and colleagues (1988) found that "languages and 
genes have a similar geographical distribution even at a microareal level." 
With regard to Europe, Guido Barbujani and Robert Sokal (1990: 1816) re- 
port that "of the 33 gene-frequency boundaries discovered . . . , 31 are co- 
incident with linguistic boundaries marking contiguous regions of differ- 
ent language families, languages, or dialects." In an investigation of the 
Yanomami Indians of South America, Barbujani, Oden, and Sokal (1989: 
386) concluded that “both at the level of variation among dialect groups as 
well as within such groups the regions of rapid genetic change agree with 
observable linguistic differences. These findings support conclusions from 
other studies on the relation of genetics and language." 

Although the idea that there is a general congruence between languages 
and different ethnic groups fell out of favor in the twentieth century, for a 
variety of reasons, the general correlation was in fact noticed long ago. Ac- 
cording to Trombetti (1905: 55), "Language, it is true, is not a trait that is 
directly inherited like physical structure, but in reality agreement between 
languages and races is the rule, disagreement, the exception." Nor did this 
correlation escape the notice of Charles Darwin (1859: 405): "If we pos- 
sessed a perfect pedigree of mankind, a genealogical arrangement of the 
races of man would afford the best classification of the various languages 
now spoken throughout the world; and if all extinct languages, and all in- 
termediate and slowly changing dialects, had to be included, such an ar- 
rangement would, I think, be the only possible one." 

I believe that we now stand on the threshold of a multi-disciplinary in- 
vestigation into human prehistory that will lead, and is already leading, to 
a more precise understanding of the history of our species than has hith- 
erto seemed possible. The independent evidence of historical linguistics, 
human genetics, and archaeology is beginning to be integrated into a co- 
herent and consistent framework for understanding human evolution over 
the past 100,000 years. This is not to say that all three of these disciplines 
yet agree on all particular points, much less that any of these fields is cor- 
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rect in all of its claims. But what is now known seems very promising. As 
Stephen Jay Gould (1989: 22) puts it, "The reconstruction of the human 
family tree—its branching order, its timing, and its geography— may be 
within our grasp. Since this tree is the basic datum of history, hardly any- 
thing in intellectual life could be more important." In a similar vein the Brit- 
ish archaeologist Colin Renfrew (1991a) has recently written that "in a 
couple of decades, I predict, we shall see emerging a new synthesis be- 
tween historical linguistics, prehistoric archaeology, and molecular ge- 
netics.” I would go even further. I believe we can see it emerging even 
today. 


BIBLIOGRAPHIC UPDATE 


Since this book went to press some five years ago I have become aware of 
some sources that were inadvertently omitted, while other works have ei- 
ther come to my attention or were published during this time. 


General 


There are a number of good introductions to historical and comparative 
linguistics, the most recent being Bynon (1977), Hock (1986), and Anttila 
(1989). АП of these works, however, emphasize historical rather than com- 
parative aspects and devote most of their attention to the Indo-Hittite fam- 
ily; for these reasons they are largely complementary to this book. Fleming 
(1987b) offers a world-wide survey of linguistic classification. For the im- 
portance of statistics in historical linguistics, see Embleton (1986). On the 
emerging synthesis between genetics, archaeology, and linguistics, see 
Greenberg, Turner, and Zegura (1986), Cavalli-Sforza et al. (1988, 1991), 
Shevoroshkin (1989b, 1990, 1991), Turner (1989), and Renfrew (1991a, b). 
Within the linguistic community there has developed in recent years a reac- 
tion against the widespread belief that the period of linguistic taxonomy is 
basically over, owing to the alleged limitations of the comparative method. 
A newsletter, Mother Tongue, was founded by Harold Fleming in 1986 spe- 
cifically to confront these erroneous and unjustified beliefs. An especially 
good introduction to the problems of fossil evidence for hominid pre- 
history is given in Lewin (1987). 


Khoisan 


Additional etymologies connecting Sandawe and Hadza to Khoisan are 
given in Fleming (1987a). General works on the family may be found in 
Vossen and Keuthmann (1986). Also of interest is Traill (1986). 
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Niger-Kordofanian 


For an overview of the family, see Bendor-Samuel (1988). Meeussen 
(1980a, b) provides Bantu reconstructions. 


Nilo-Saharan 


Recent contributions to this perennially neglected family include Bender 
(1989, 1991). 


Australian 


Blake (1988) argues, on the basis of pronouns, that the traditional classi- 
fication of Australian languages is incorrect, and that the family consists of 
two basic branches, Pama-Nyungan and non-Pama-Nyungan. Dixon and 
Blake (1979—) present general surveys of various Australian languages. 


Indo-Pacific 


Foley (1986) presents an overview of Papuan languages. 


Austric 


Edmondson and Solnit (1988) contains a series of studies on comparative 
Daic (Kadai), including an extensive set of Proto-Kam-Sui reconstructions 
by Graham Thurgood. Benedict (1990) argues that Japanese is a member of 
Austro-Tai, closest in fact to Austronesian. Such a classification disagrees, 
however, with those of Greenberg (to appear), who places it in Eurasiatic, 
and Starostin (1991), who considers it a member of Altaic within Nostratic. 

Additional Austronesian etymologies have been proposed in Blust (1980, 
1983—84, 1985), and Blust's definitive Austronesian etymological dictionary 
is eagerly awaited. 


Dene-Caucasian 


The basic works defining Dene-Caucasian are Starostin (1984), Nikolaev 
(1986), and Bengtson (1991a, b), all of which are contained in Shevoroshkin 
(1991). Ruhlen (1989d) supplies some connecting links between Yeniseian 
and other families, particularly Na-Dene. 

Klimov (1986) provides an overview of the (North) Caucasian languages, 
as does Catford (1991). Until Nikolaev and Starostin's etymological dictio- 
nary of (North) Caucasian is published, the list of Caucasian roots given in 
Nikolaev and Starostin (1986) will have to suffice. The extinct Hurrian, 
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and the closely related Urartian, languages have recently been shown by 
Diakonov and Starostin (1986) to belong to the Eastern branch of the Cau- 
casian family. Ivanov (1985) argues that the extinct language Hatti belongs 
to the Western branch of (North) Caucasian. 

Proto-Yeniseian reconstructions, and the etymologies upon which these 
are based, are given in Starostin (1982). An English translation of the recon- 
structions is given in Ruhlen (19894). 

For Sino-Tibetan, Starostin (1989b) presents a new reconstruction of Old 
Chinese phonology. Ramsey (1987) offers an overview of the languages of 
China, including the many non-Chinese languages of the south. Sagart 
(1990) suggests that Chinese is more closely related to Austronesian than to 
Tibeto-Burman. 

А number of recent works by Pinnow (1984, 1985a, b, 1986a, b, 1988, 
1990) continue his many contributions to the Na-Dene family. Pinnow, it 
should be stressed, has been one of the staunchest supporters of the Na- 
Dene affiliation of Haida, a "pseudo-controversy" that now appears over. 
Ruhlen (1989c) proposes additional Na-Dene etymologies based on Green- 
berg's unpublished Na-Dene notebook. 


Afro-Asiatic 


Diakonov (1988) and Perrot (1988) offer overviews of the family, while 
Ehret (1989) gives some 800 reconstructed roots. Stolbova (1987) presents 
an extensive list of West Chadic reconstructions. 


Kartvelian 


General surveys of the family are given in Klimov (1986) and in Catford 
(1991). 


Eurasiatic 


The Eurasiatic family is defined, with supporting evidence, in Green- 
berg (to appear). Dolgopolsky (to appear) presents his version of Nostratic, 
whereas Bomhard (1990) offers a somewhat different version of Nostratic 
based on the glottalic theory of Indo-Hittite consonantism. An English 
translation of Illich-Svitych's (1967) Nostratic reconstructions is given in 
Kaiser (1989). 

In addition to the standard Indo-Hittite etymological dictionary by Po- 
korny (1959), we now have Mann (1984—87) and Décsy (1991). During the 
past several years the homeland and spread of the Indo-Hittite languages 
has once again become a focus of research in both archaeology and lin- 
guistics. Gamkrelidze and Ivanov (1984, 1990), Renfrew (1987, 1989), and 
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Dolgopolsky (1988) all agree in locating the Indo-Hittite homeland in Ana- 
tolia (modern Turkey), an idea that fits well with Sturtevant's Indo-Hittite 
hypothesis. Renfrew attempts to connect the spread of Indo-Hittite with 
the spread of agriculture from the Near East around 8,000 years ago, a date 
some two millennia older than the commonly accepted one. Mallory (1989) 
presents a view of the origin of the Indo-Hittite family more in line with the 
traditional views of Marija Gimbutas. 

For Uralic, Rédei (1986—88) has replaced earlier works as the definitive 
etymological dictionary of the family; Décsy (1990) also presents a list of 
Uralic roots, but not those restricted to Finno-Ugric, for which the work by 
Rédei must be consulted. The standard etymological dictionary for the 
Samoyed family is Janhunen (1977). 

For Altaic, Starostin (1991) presents a reconsideration of this family, 
which for Starostin includes Korean and Japanese, in addition to Turkic, 
Mongolian, and Tungus. Shibatani (1990) surveys both Ainu and Japanese. 

Mudrak (1986) offers а list of Chukchi-Kamchatkan reconstructions, and 
Mudrak (1989) gives a list of Eskimo-Aleut reconstructions. 


Amerind 


Regna Darnell's (1990) biography of Sapir provides interesting insights 
into баріг5 taxonomic work, as does Victor Golla's (1984) edition of the 
Sapir-Kroeber correspondence. A compendium of Algonquian reconstruc- 
tions is given in Aubin (1975), and Lincoln and Rath (1980) provide a list of 
North Wakashan roots. Berman (1990) offers additional Algic etymologies. 
Kinkade (1988) presents a thorough examination of Proto-Salish color 
terminology, and Galloway (1988) offers some Proto-Central Salish re- 
constructions. DeLancey, Genetti, and Rood (1988) suggest additional Pe- 
nutian etymologies, and Callaghan (1988) presents a list of Proto-Miwok- 
Costanoan cognate sets. Rensch (1989) offers an etymological dictionary of 
the Chinantec languages, considered a single (highly diversified) language 
in this book. Kaufman (1992) argues that Oto-Manguean is closest to Hokan, 
the two considered by Greenberg (1987) to be members of Central Amerind 
and Northern Amerind, respectively. 

For South America, several general surveys have recently appeared, in- 
cluding Klein and Stark (1985), Payne (1990), and Derbyshire and Pullum 
(1986, 1990, 1991). 
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Strahlenberg, P. J., 34, 66f, 
128f 
Street, J. C., 130ff 
Sturdevant, E. H., 56f 
Subrahmanyam, P. 5., 139 
Swadesh, M., 216, 225, 
234f, 260 
Swanton, J., 197, 213, 228 


Tacitus, 65 

Teeter, K. V., 231 
Tegnér, E., 51 

Thelwall, R., 112 
Thomas, J. M. C., 102 
Thomas, K. H., 175 
Thompson, E., 197 
Thompson, L. C., 232 
Thomsen, V., 49, 51, 259 
Threlkeld, L. E., 186 
Townsend, P. and W., 179 
Trager, G. L., 213 
Trombetti, A., 73 


Trubetzkoi, N. 5., 73 
Tucker, A. N., 110, 112 
Turner, W., 197, 228 


Uhle, M., 228 
Underhill, R., 133 
Uslar, P. K., 72 


Vater, J. S., 197, 228 

Vedder, H., 116 

van der Veen, H., 175 

Verner, K., 49, 50-51 

Vidal, O. E., 97 

Vintilá-Rádulescu, I., 72 

Vladimirtsov, B. J., 132 

Voegelin, C. F. and F. M., 
56, 68%, 73, 133, 145, 148, 
155f, 177, 187, 216-17, 
231, 237f 

Vogel, M., 65 

Vogt, H., 72f 

Voorhoeve, C., 177, 179 


Wallace, A. R., 25, 269 
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Walsh, M. J., 188f 

Waltz, N. E., 239 

Wang, W. S.-Y., 267 

Watuseke, F. 5., 179 

Welmers, W. E., 81, 99, 101, 
110, 121—22, 147, 256 

Westermann, D., 79-82 
passim, 97f, 102, 110f 

Westphal, E. O. J., 117-18, 
119 

Wheeler, A., 239 

Whorf, B., 213, 223, 237 

Williamson, K., 99, 104 

Winckler, H., 55 

Wulff, K., 122 

Wurm, 5. A., 168f, 177-87 
passim, 260 


Zeisberger, D., 205, 228 
Zeuss, ). K., 44 
Z'graggen, J., 177, 179 
Zide, N., 156 

Zorc, R. D., 168 
Zvelebil, K. V., 139 


Language Group Іпйех 


This is ап index to the more important language families that appear in the complete 
classification (section 8.6). It does not list every genetic group in the classification. 
In particular, several kinds of groups have been systematically eliminated, as fol- 
lows. (1) Most groups consisting of a single language, such as KABYLE and BERTA 
(but a few exceptions to this general rule remain, e.g. ALBANIAN and ALEUT). 
(2) Many geographical (or other generic) subdivisions of a more general group, 
such as EAST CHADIC, MIDDLE PAMAN, and ANGAN PROPER, which should 
be sought under CHADIC, РАМАМ, апа ANGAN, respectively. In cases where 
what appears to be a subdivision (e.g. WEST ATLANTIC) is in fact an independent 
entity (there is no EAST ATLANTIC or NORTH ATLANTIC), the group is of course 
retained and is listed twice (in this case under WEST ATLANTIC and ATLANTIC, 
WEST). (3) All groups with names such as "GROUP A" or "GROUP 1" have been 
omitted. 

The rules of alphabetization of group names follows the same principles outlined 
in the introduction to the Language Index. As in the Language Index homonymous 
names are distinguished by specifying a higher-level grouping after each: 


SEPIK (SEPIK-MADANG) 
SEPIK (SEPIK-RAMU) 


ABXAZ-ABAZA, 324 

ACHINESE-CHAMIC, 344 

ACROA, 377 

ACUA, 377 

ADAMAWA, 102, 278, 305 

ADAMAWA-UBANGIAN, 
82Н, 95-102 passim, 


30577 
ADELBERT RANGE, 182, 
357 


ADI-NISHI, 331 

ADMIRALTY ISLANDS, 
167, 349—50 

ADZERA, 348 

AFAR-SAHO, 322 

AFRICA, SOUTHERN, 84, 
117, 301 

AFRO-ASIATIC, 76, 81-94 
passim, 253, 258, 279, 
286, 320—23 


АТАМ, 360 
AKA-KELO-MOLO, 317 
AKHA, 333 
ALABAMA-KOASATI, 368 
ALAMBLAK, 360 
ALASKAN YUPIK, 329 
ALBANIAN, 56, 56f, 286, 
326 
ALEUT, 193--94, 217, 329 
ALGIC, 228-31, 288, 366 
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ALGONQUIAN, 205-22 
passim, 228— 31, 288, 366 
ALMORA, 331 
ALMOSAN, 228, 230, 366 
ALMOSAN-KERESIOUAN, 
228, 230, 366—67 
ALOR, 358 
ALOR-PANTAR, 359 
ALTAI, 328 
ALTAIC, 125, 127-33, 258f, 
280, 286, 328—29 
ALUR-ACHOLI, 318 
AMADI, 102, 307 
AMALUMUTLE, 345 
AMAZONIAN, 374 
AMERIND, 200-242, 253, 
258, 260f, 288, 366-77 
AMERIND, CENTRAL, 
228, 230, 370—71 
AMERIND, МОКТНЕКМ, 
220, 230, 366-69 
АМНАКІС-АКСОВВА, 323 
AMTO-MUSIAN, 178f, 182, 


359 
ANATOLIAN, 35—36, 56ff, 
125, 286, 325 
ANDAMAN ISLANDS, 


175f, 181, 183, 354 
ANDAMANESE, GREAT, 


354 
ANDAMANESE, SOUTH, 


354 
ANDEAN, 220, 228ff, 238f, 
278, 289, 371-72 
ANDI, 74, 324 
ANGAL-KEWA, 354 
ANGAMI, 332 
ANGAN, 178f, 182, 355 
ANGAS, 321 
ANGKUIC, 335 
ANNABERG, 360 
ANTIOQUIA, 238, 371 
АО, 332 
АРАСНЕ, WESTERN, 
-NAVAJO, 199,365 
APACHEAN, 197, 199, 228, 
365 
APO DUAT, 342 
ARABIC, 77, 92, 323 
ARABO-CANAANITE, 92, 


323 
ARAFUNDI, 360 
ARAI, 178, 182 
ARAMAIC, 77, 323 
ARANDIC, 189, 364 
ARAONA, 376 


ARAONA-TACANA, 376 
ARAPESH, 359 
АКАУУАКАМ, 208, 220, 225, 
228, 240, 373-74 
ARAWAN, 240, 373 
АКАУҒЕ, 349 
ARIKEM, 373 
ARMENIAN, 56f, 125, 286, 


325 

ARTUYA, 364 

ARU, 346 

ARUAK, 238, 371 

ARUTANI-SAPE, 239, 372 

ASLIAN, 151, 155f, 337 

ASMAT-KAMORO, 355 

ASTROLABAN, 347 

ATA-TIGWA, 340 

ATAITAN, 361 

ATAN, 357 

ATAYALIC, 165ff, 338 

ATHABASKAN, 195—200 
passim, 208—17 passim, 
221, 223, 228, 257, 278, 
365 

ATHABASKAN-EYAK, 195, 
197-99, 217, 365 

ATLANTIC, WEST, 85f, o7f, 
100, 303 

ATLAS, 93, 320 

AUSTRALIAN, 159, 184— 
89, 253—60 passim, 
362—64 

AUSTRIC, 126, 148-56, 
159-68, 258, 260, 287, 


334-53 
AUSTRO-TAI, 126, 1524, 
260f, 337-53 
AUSTROASIATIC, 125f, 
144, 148-56, 260ї, 287, 


334-37 
AUSTRONESIAN, 144, 148, 
151—54, 159—68, 173, 288, 


338-53 
AVARO-ANDI-DIDO, 74, 

324 
AWIN-PA, 179, 355 
AWJILA-SOKNA, 320 
AWYU, 175, 355 
AWYU-DUMUT, 175, 355 
AYMARAN, 239, 372 
AZERBAIJANI, 328 
AZTEC PROPER, 214, 237, 


370 

AZTECAN, 205, 209, 213, 
217—23 passim, 228f, 
236f, 370 
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BABINE-CARRIER, 365 

BACTRIAN, 325 

BADE, 321 

BADE-WARJL 321-22 

BAFIA, 312 

BAGIRMI, 318 

BAHINEMO, 360 

BAHING, 146, 332 

BAHNARIC, 151, 155f, 
335-36 

BAI, 145f, 331 

BAI-VIRI, 306 

BAIBAI, 359 

BAINING-TAULIL, 361 

BAJAW, BORNEO COAST, 
344 

ВАК, 101, 303 

BAKA-GUNDI, 307 

BALANT-GANJA, 303 

BALI-SASAK, 344 

BALI-VITU, 349 

BALTIC, 37, 42, 46, 54-60 
passim, 327 

BALTIC FINNIC, 68f, 328 

BALTO-SLAVIC, 37, 58, 287, 


327 
BALUCHI, 326 
BAMBUKA, 102, 306 
BAMILEKE, 311 
BANDA (CENTRAL MA- 
LUKU), 102, 307 
BANDA (UBANGIAN), 345 
BANE, 103, 311-12 
BANGI-NTOMBA, 313 
BANIVA-YAVITERO, 374 
BANTOID, 103, 311-16 
BANTOID, NON-BANTU, 
103, 311 
BANTON, 341 
BANTU, BROAD, 81, 83, 
253, 255, 257, 311-16 
BANTU, CENTRAL, 81, 83, 
253, 255, 257, 313-16 
BANTU, GRASSFIELDS, 
81, 83, 253, 255, 257, 
311-12 
BANTU, NARROW, 81, 83, 
253, 255, 257, 312-16 
BANTU, NIGERIAN, 81, 
83, 95, 253, 255, 257, 311 
BANTU, NORTHWEST, 81, 
83, 253, 255, 257, 312-13 
BARA, 372 
BARAIC, 356 
BARAM-TINJAR, 343 
BARAMBO-PAMBIA, 307 
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BARASANO, 372 
BARBACOAN, 238, 371 
BARI, 109f, 318 
BARIAI, 348 
BARIAI-NGERO, 348 
ВАКІС, 144, 147, 332 
ВАКІТО, 343 
BARKLY, WEST, 188, 362 
BASAA, 312 
BASHIIC, 339 
BASHIIC-CENTRAL 
LUZON-NORTHERN 
MINDORO, 339 
BASSA (CARA), 310 
BASSA (KRU), 304 
BATA, 321 
BATAK, 344 
BAULE-AGNI, 308 
BE-KAM-TAI, 337-38 
BEAVER-SEKANI, 199, 365 
BEBOID, 312 
BEL, 547 
BELLA COOLA, 366 
ВЕМВА, 315 
BEMBE, 313 
ВЕМА-КІМСА, 314 
BENDI, 103, 310 
BENDI-BOKYI, 310 
BENGALI-ASSAMESE, 
325 
BENGUET, 339 
BENUE-ZAMBESI, 100, 
105, 309— 16 
BERBER, 78-93 passim, 
286, 320 
BEWANI, 358 
BEZHTA-HUNZIB, 324 
BIAKIC, 346 
BIHARI, 38, 325 
BIKOL, 161, 257, 340-41 
BILIC, 341 
BILIN-KAMTA, 322 
BIMA-SUMBA, 346 
BINANDEREAN, 178, 182, 
356 
BIRA-HUKU, 313 
BIRD'S HEAD, 178, 359 
BIRD'S HEAD, EAST, 178, 
181, 359 
BIRD'S HEAD, SOUTH, 
178, 1811, 358 
ВІКОМ, 310 
BIROM-MIGILI, 310 
BISA-LAMBA, 315 
BISAYAN(CENTRAL 
PHILIPPINES), 341 


BISAYAN (DUSUN BISAYA), 
342 
BIU-MANDARA, 92, 
320—21 
BLAAN, 341 
BOAZI, 355 
BODIC, 144, 331 
BODO, 146, 332 
BODO-GARO, 332 
BOGHOM, 322 
BOLE, 321 
BOLE-ANGAS, 321 
BOLE-TANGALE, 321 
BOLGAR, 133, 328 
BOLIVIAN, 374 
BOMBERAI, 346 
BOMBERAI, WEST, 178, 
1814, 356 
ВОМСО, 318 
BONGO-BAGIRMI, 109, 
112, 318 
BONTAWA, 332 
BONTOK, 339 
BONTOK-KANKANAY, 339 
BORA-WITOTO, 240, 375 
BORAI-HATTAM, 178, 359 
BORAN, 375 
BORDER, 358 
BORNEO, 165, 167f, 
342-43 
ВОКМЕО, NORTH, 344 
BORORO, 241, 376 
BOSAVI, 555 
BOUGAINVILLE (EAST PA- 
PUAN), 176, 178, 361 
BOUGAINVILLE 
(OCEANIC), 167, 350 
BRAHMAN, 357 
BRAO-KRAVET, 336 
BRAZIL, NORTHERN, 375 
BRINKEN, 362 
BRINKEN-WAGAYDY, 362 
BRYTHONIC, 59, 327 
BUA, 102, 306 
BUANG, 348 
BUBI-BENGA, 312 
BUHI-DARAGA, 340 
BUHID-TAUBUID, 340 
BUIN, 361 
BUKA, 350 
BULAKA RIVER, 175, 358 
BULANG, 335 
BULI-KOMA, 305 
BULLOM, 304 
BUNABAN, 189, 362 
BUNGANDIDJ, 363 
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BUNGKU-MORI, 342 
BURA, 320-21 
ВОКА-НІСІ, 320—21 
BURA-MANDARA, 320-21 
ВОКАККАМ, 189, 362 
BURMESE-LOLO, 1456, 333 
BURMESE-MOSO, 146, 333 
BURMIC (BURMESE- 
LOLO), 146, 333 
BURMIC (TIBETO- 
BURMAN), 144, 146, 
332— 
BURU-TALIABO, 345 
BUSHONG, 313 
BUSU, 348 
BUTUAN-TAUSUG, 341 
BWAIDOKA, 349 
BWAMU-OTI-VOLTA, 305 
BWE, 331 


CADDOAN, 208-18 pas- 
sim, 228-32 passim, 288, 
367 

CAHITA, 237, 370 

CAHUAPANAN, 371 

CAHUAPANAN- 
ZAPAROAN, 239, 371 

CAHUILLA-CUPENO, 237, 


370 
CALABAR RIVER, 309 
CALIFORNIA (ATHABAS- 
KAN), 199, 365 
CALIFORNIA (PENU- 
TIAN), 235, 368 
CAMPA, 374 
CANAANITE, 92, 323 
CANADIAN (ATHABAS- 
КАМ), 199, 365 
CANADIAN (PENUTIAN), 
235, 367 
CANDOSHI, 373 
CANGIN, 101, 303 
CAPE GLOUCESTER, 348 
CAQUETION, 374 
CARA, 105, 309—10 
CARIB, 208, 220, 238ff, 375 
CARRIER-CHILCOTIN, 
199, 365 
CASPIAN, 326 
CATACAO, 239, 371 
CATAWBA, 367 
CATUQUINAN, 239, 372 
CAUCASIAN, 24, 74-75, 
128, 258-61, 286, 324 
CAVINENA-TACANA, 376 
CAYAPA-COLORADO, 371 
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CEBUAN, 341 
CELEBES, 165, 167f, 342 
CELTIC, 37, 42, 46,54-58, 
257, 287, 327 
CENTRAL PROVINCE, 349 
CHADIC, 83—92 passim, 
162, 278, 286, 320—22 
CHAGA, 314 
CHAM-CHRU, 344 
CHAM-MONA, 305 
CHAMBA, 102, 306 
СНАМІС, 126, 150f, 155, 
166, 344 
CHANG-TANGSA, 332 
CHAPACURA, 373 
CHAPACURAN, 240, 373 
CHAPLINO-NAUKAN, 


329 
CHARRUAN, 241, 375 
CHAWAI, 309 
CHECHEN-INGUSH, 324 
CHIBCHAN, 208, 220, 228, 


238, 371 
СНІВСНАМ РКОРЕЕ, 238, 


371 
CHIBCHAN-PAEZAN, 208, 
228— 30, 238, 289, 371 
CHIBCHAN, NUCLEAR, 
238, 371 
CHILCOTIN, 365 
CHIMAKUAN, 208-18 pas- 
sim, 222, 2258-32, 366 
CHIMBU, 355 
CHINANTECAN, 208, 228, 
237f, 371 
CHINBOK, 333 
CHINESE, 142--46 passim, 
257, 331 
CHIPAYAN, 217, 240, 373 
CHIQUITO, 241, 376 
CHIRIGUA, 376 
CHITRAL, 325 
CHIWERE-WINNEBAGO, 
233, 367 
CHOCHO-POPOLCC, 371 
CHOCO, 208, 238, 371 
CHOISEUL, 167, 350 
CHOKWE-LUCHAZI, 515 
CHOLONA, 239, 371 
CHONG, 336 
CHOPI, 316 
CHUJEAN, 368 
CHUKCHI- 
KAMCHATKAN, 126, 
128, 133-36, 217, 257ff, 
280, 287, 329 


CHULYM-SHOR, 328 
CIRCASSIAN, 324 
COAHUILTECAN, 208—17 
passim, 228, 369 
COCONUCAN, 371 
COLOMBIA, SOUTH- 
EASTERN, 375 
CORACHOL, 209, 228, 
236f, 370 
CORDILLERAN, CEN- 
TRAL, 338-39 
CORDILLERAN, NORTH- 
ERN, 338 
CORDILLERAN, NU- 
CLEAR, 339 
CORDILLERAN, SOUTH- 
CENTRAL, 168, 338-39 
CORDILLERAN, SOUTH- 
ERN, 339 
COSTANOAN, 208--11, 
218, 229, 368 
COTABATO, 340 
CROSS, LOWER, 103, 309 
CROSS, UPPER, 103, 309 
CUA-KAYONG, 336 
CUPAN, 237, 370 
CUSHITIC, 78—84 passim, 
87-91, 278, 286, 322-25 
CUSHITIC, CENTRAL, 84, 
89f, 322 
CUSHITIC, EASTERN, 84, 
89f, 322-23 
CUSHITIC, SOUTHERN, 
84, 89f, 525 
CUTINANA, 374 


DABA, 321 

DAGAN, 178, 182, 356 

DAGESTAN, 74, 324 

DAIC, 126, 148-56, 337-38 

DAJU, 112, 317 

DAKA, 102, 306 

DALY (AUSTRALIAN), 
188f, 362 

DALY (MALAKMALAK), 
362 

DAMA-MONO, 306 

DANAO, 340 

DANGLA, 320 

DANI, 356 

DANI-KWERBA, 178, 182, 
356 

DARDIC, 325 

DAYAK, LAND, 343 

DAYAK, MALAYIC, 344 

DELTA, 308 
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DELTA CROSS, 100, 103, 
309 
DELTA-CALIFORNIAN, 


369 
DELTA, CENTRAL, 309 
DESANO, 372 
DHEGIHA, 233, 367 
DHIMAL-TOTO, 331 
DIDO-HINUX, 324 
DIGARO-MIDU, 331 
DINKA, 109, 318 
DINKA-NUER, 110, 318 
DIOLA, 303 
DIOLA-GUSILAY, 303 
DIZOID, 92f, 322 
DJAMINDJUNGAN, 189, 
362 
DJERAGAN, 189, 362 
DOBU, 349 
DOGHOSE-GAN, 305 
DOGHOSE-GURUNSI, 102, 
395 
DOGON, 100, 304 
DRAVIDIAN, 125, 128, 136- 
41, 150-51, 259f, 330 
DUALA, 312 
DUAU, 348 
DUKA, 310 
DULLAY, 322 
DUMAGAT, 338 
DUMI, 332 
DUMUT, 355 
DUNA-POGAYA, 355 
DURU, 102, 306 
DURUBALIC, 189, 363 
DUSUN, 342 
DUSUN-BISAYA, 342 
DYIRBALIC, 189, 363 
DZORGAIC, 332 


EAST NEW GUINEA 
HIGHLANDS, 354-55 
EASTERN JEBEL, 112, 317 

EBANG-LARU, 302 

EBANG-LOGOL, 302 

EDO, 100, 103, 278, 308—9 

EFIK, 309 

EGGON, 310 

EKAGI-WODANI-MONIL, 
356 

EKOID, 311 

EKOID-MBE, 311 

ELAMITE, 330 

ELAMO-DRAVIDIAN, 141, 
258, 287, 330 
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ЕГЕМАМ, 175, 178, 182, 
358 

ELGON, 318 

EMUAN, 357 

ENGA, 355 

ENYA, 313 

ЕРІ, 352 

EOUATORIAL, 220, 225-30 
passim, 238, 240, 289, 
372-74 

EQUATORIAL- 
TUCANOAN, 230, 
372-74 

ЕКАР, 354 

ЕККОМАМСА, 350 

Е5КІМО, 128, 193, 217, 
287, 329 

ESKIMO-ALEUT, 136, 
193-95, 2084, 214-253 pas- 
sim, 2284, 257--61, 280, 
287, 329 

ЕУАРІА, 356 

EVENKI, 329 

EWE-FON, 308 

EWONDO, 312 


FARS, 326 

FIJIAN, 352 

FINISTERRE, 179, 354 

FINISTERRE-HUON, 178, 
182, 354 

FINNIC, 65, 68—70, 328 

FINNIC, NORTH, 69, 328 

FINNO-UGRIC, 65, 68, 70, 
128f, 278, 328 

FIVE NATIONS, 233, 367 

FLORES-LEMBATA, 346 

FORE, 354 

FUR, 112, 317 

FYAM, 310 


GA-ADANGME, 103, 308 
GADDANG, 338 
GADSUP-AUYANA-AWA, 


354 
GAHUKU, 354 
GALABOID, 323 
GALLO-IBERO- 
КОМАМСЕ, 326 
GALLO-ROMANCE, 326 
GAN-KEJIA, 331 
GARAWAN, 189, 326 
GARO, 332 
GBANZILI, 307 
GBANZILI-SERE, 306-7 
GBAYA, 102, 306 


GBAYA-GBANZILI, 306 

GBI-GOLO, 307 

GE, 208, 220, 228f, 241, 377 

GE-KAINGANG, 241, 
376-77 

СЕ-РАМО, 230, 375-77 

СЕ-РАМО-САКІРВ, 220, 230, 


374 
GEELVINK BAY (INDO- 
PACIFIC), 182, 359 
GEELVINK BAY 
(SOUTH HALMAHERA- 
GEELVINK BAY), 178, 
181, 346-47 
GEELVINK BAY, EAST, 178, 
181, 359 
GELA, 
GELA-GUADALCANAL, 


351 
СЕКМАМІС, 33, 37, 42, 46, 
54—60 passim, 257, 287, 
327 
GIMIRA, 322 
GIMOJAN, 93, 322 
GLAVDA, 321 
GOAM, 361 
GOGO, 314 
GOGODALA, 355 
GOGODALA-SUKI, 178, 
182, 355 
GOIDELIC, 59, 327 
GOILALAN, 178f, 182, 356 
GONDI-KOYA, 330 
GONDI-KUI, 138, 330 
GONGA, 93, 322 
GONGA-GIMOJAN, 92#, 


322 
GORONTALO, 342 
GRASS, 360 
GRASSFIELDS BANTU, 
103, 311 
GREEK, 37, 42, 46, 54—58 
passim, 278, 326 
GTA', 334 
GUADALCANAL, 351 
GUAHIBAN, 240, 373 
GUAICURUAN, 241, 375 
GUAPORE, 373 
GUAYMI, 238, 371 
GUAYUPEAN, 374 
GUBU-YAKPA, 307 
GUERE, 304 
GUIANA, EAST-WEST, 


375 
GUIANA, SOUTHERN, 
375 
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GUIANA, WESTERN, 375 

GULE, 216f, 228, 230, 234f, 
368 

GUM 

GUMBAYNGGIRIC, 189, 

363 

GÜMUZ, 319 

GUNWINYGIC, 362 

GUNWINYGUAN, 188f, 
362 

GUR, 83f, 95-102 passim, 


304-5 
GURAGE, EAST-HARARI, 


323 
GURAGE, NORTH, 323 
GURAGE, WEST, 323 
GURMA, 305 
GURMA-YOM, 305 
GURUNG-THAKALI, 331 
GURUNSI, 305 
GURUNSI-KIRMA, 305 
GURUNTUM, 322 
GUSAP-MOT, 354 
GUTOB-REMO, 334 
GUTOB-REMO-GTA”, 334 


HAI.N//UM, 301 

HAKA, 333 

HALMAHERA, NORTH- 
ERN, 178, 359 

HALMAHERA, SOUTH, 
165, 167, 346-47 

HALMAHERA, SOUTH, 
-GEELVINK BAY, 176, 
346-47 

HALMAHERA, SOUTH, 
-NORTHWEST NEW 
GUINEA, 165, 167, 176, 
346-47 

HAN-KUTCHIN, 199, 365 

HANSEMAN, 357 

НАКАКМВЕТ, 374 

HARE-CHIPEWYAN, 199, 
365 

HARE-SLAVE, 365 

HATUHAHA, 346 

HAUSA, 321 

HAYA-KWAYA, 314 

HEIBAN, 99, 302 

HIGI, 321 

HIMALAYAN, CENTRAL, 
146, 331 

HIMALAYAN, EAST, 146, 


332 
HIMALAYAN, WEST, 146, 
331 
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HOAMOAL, 345 

HOKAN, 209-20 passim, 
226-37 passim, 289, 369 

HOLU, 315 

HOTE, 348 

HSIFAN, 333 

HUARI, 239, 372 

HUARPE, 238, 371 

HUASTECAN, 228, 368 

HUON, 176, 179, 354 

HUON GULE, 348 


IBALOI-KARAW, 339 
IBANAG-ATTA, 338 
IBERO-ROMANCE, 326 
IDOMA, 308 

IFUGAO, 339 

IGBO, 309 

IJO, 83, 95, 100, 103, 257, 


307 
IJO-DEFAKA, 307 
INANWATAN, 358 
INDIA, NORTHERN, 325 
INDIC, 38, 42, 46, 58, 125, 
257, 325 
INDO-EUROPEAN, 24-25, 
42, 46, 54, 56, 204, 253, 
256, 260f, 279, 325 
INDO-HITTITE, 35-63, 
125, 258f, 279f, 286, 
325-27 
INDO-IRANIAN, 38, 46, 
54, 56, 58, 278, 286, 
25—26 
INDO-PACIFIC, 159, 
172—835, 257—631, 288, 
354-61 
INDONESIAN, 151, 163, 
165, 344 
INGALIK-KOYUKON, 199, 
365 
INLAND GULF, 178f, 182, 


357 
INTER-ANDINE, 238, 371 
INUIT, 194, 329 
IRAN, CENTRAL, 326 
IRANIAN, 42, 46, 54, 56, 
58, 125, 325-26 
IROQUOIAN, 208f, 214-20 
passim, 228-33 passim, 
288, 367 
ISUMRUD, 357 
ITALIC, 37, 42, 46, 54—59 
passim, 257, 287, 526 
ITALO-ROMANCE, 326 
IVATAN, 339 


IWAM, 360 
IXCATEC-CHOCHO, 371 
IXILAN, 369 

IYORA, 363 

IZANENI, 374 


JANJERO, 322 

JAPANESE-RYUKYUAN, 
132, 259, 287, 329 

JARAWAN, 105, 311 

JARAWAN-EKOID, 311 

JEBEL, EASTERN, 112, 317 

JEH-HALANG, 336 

JERA, 309 

JIMI, 555 

JIRAJARAN, 238, 371 

JIVARO PROPER, 373 

JIVAROAN, 208, 220, 230, 
240, 575 

JOSEPHSTAAL, 357 

JOSEPHSTAAL-WANANG, 
297 

JUKUN, 309 

JUKUNOID, 100, 103, 278, 


309 
JURUA-PURUS, 374 


KA, 308 
KABENAU, 356 
KACHIN, 146, 333 
KACHIN-LUIC, 146, 333 
KADAIDAXC, 126, 148- 
56, 337 
KADARA, 310 
KADUGLI, 99, 302 
KAILI-PAMONA, 342 
KAINGANG, 241, 376 
KAIRIRU, 347 
KAIWA-SIBOMA, 348 
KAJE, 310 
KAJE-KADARA, 310 
KAKABAI, 348 
KAKO, 312 
KALAM, 354 
KALAM-KOBON, 354 
KALAMIAN, 340 
KALAPUYA, 207, 214, 217, 
367 
KALENJIN, 110, 318 
KALINGA, 339 
KALINGA-ITNEG, 339 
KALKATUNGIC, 364 
KALLAHAN, 339 
KAM-SUI, 1536, 337 
KAM-TAI, 145, 337-38 
КАМАКАМ, 241, 376 
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KAMBARI, 310 
KAMBARI-DUKA, 310 
KAMBATA-HADIYYA, 322 
KAMUKU, 310 
KANAURI, 146, 331 
KANJOBAL-JACALTEC, 


369 
KANJOBALAN, 369 
KANJOBALAN, GREATER, 
368—69 
KANKANAY, 339 
KANNADA, 138ff, 330 
KANURI, 317 
KANYARA, 189, 364 


KARDU, 189, 364 
KAREN, 143-46, 150, 153, 
287, 331 
KARIRI, 373 
KARIRI-TUPI, 240, 373 
KARNA, 364 
KARNIC, 189, 364 
KATLA, 99, 302 
KATU-THAP, 335 
KATUIC, 155f, 335 
KAUKOMBARAN, 357 
KAURE, 178, 182, 358 
KAWAHIB, 373 
KAYAGAR, 178, 182, 356 
KAYAH, 331 
KAYAN, 343 
KAYAN-KENYAH, 343 
KAZUKURU, 361 
KEI-ARU, 346 
КЕГЕ (BANTU, ZONE В), 


313 

KELE (BANTU, ZONE С), 
313 

КЕМҮАН, 343 

KENYAH, MAIN, 343 

KERA, 320 

KERESAN, 216, 230, 288, 
367 

KERESIOUAN, 230, 367 

KHALKHA-BURIAT, 133, 


329 
KHAMBU, 332 
KHAMI, 333 
KHASI, 151, 334 
KHERWARI, 334 
KHMUIC, 155f, 335 
KHOISAN, 76, 814, 84, 114- 
19, 258, 260, 278, 286, 301 
KHWAREZMIAN, 325 
KIKUYU-KAMBA, 314 
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KILIVILA, 349 

KIMBE, 167, 349 

KINARAYAN, 341 

KINGKEL, 363 

KIOWA-TOWA, 214, 236, 
379 

KIRMA-LOBI, 3o5 

KIRMA-TYURAMA, 305 

KITJIC, 362 

KIWAIAN, 175, 358 

KLAU, 304 

KODAGU, 330 

KOIARIAN, 178, 182, 356 

KOIARIC, 356 

KOKON, 357 

KOLAMI-NAIKI, 330 

KOLAMI-PARJI, 139ff, 330 

KOLHRENG, 332 

KOLOPOM, 175, 178, 182, 
358 

KOMAN, 81, 84, 109, 111f, 
319 

KOMBIO, 359 

KOMBIO-ARAPESH, 178, 


359 
KOMO-OPO, 319 
КОМО-ТУУАМРА, 319 
KOMUZ, 112, 319 
KONDA-KUI, 330 
KONDA-YAHADIAN, 358 
KONGO, 314 
KONSOID, 325 
KONZO, 514 
КОКАР, 348 
KORAPUT, 334 
KORDOFANIAN, 81, 84, 
95, 99f, 302 
KOREAN-JAPANESE, 125, 
131f, 259, 329 
KORYAK-ALYUTOR, 136, 
329 
KOTOKO, 321 
KOVE-BARIAI, 348 
KOWAN, 357 
KOYUKON- 
HOLIKACHUK, 365 
KPAN-ICEN, 309 
KRESH, 318 
KRISA, 359 
KRU, 100ff, 304 
KUI-KUVI, 330 
KUIWAI, 346 
KUKI-NAGA, 146, 332-33 
KUKI, CENTRAL, 333 
KUKI, NORTHERN, 332 
KUKI, OLD, 332 


KUKI, SOUTHERN, 333 

KUKI, WESTERN, 332 

KULANGO-LORHON, 102, 
394 

KULIAK, 112, 318 

KULIN, 363 

KULINIC, 189, 363 

KUMILAN, 357 

KUMYK-KARACHAY, 328 

KUNAR, 325 

KURAMA, 309 

KURDISH, 38, 257, 286, 326 

KURI, 363 

KURUMFE-OTI-VOLTA, 
102, 305 

KUSKOKWIM, UPPER, 365 

KUTUBUAN, 178f, 182, 


355 
KUY-SUEI, 335 
KUYAN, 341 
KWALEAN, 178, 182, 356 
KWALHIOQUA, 365 
KWAMA-OPO, 319 
KWANGWA, 315 
KWANKA, 310 
KWEGU, 317 
KWENI-TURA, 303 
KWERBA, 356 
KWOMTARI, 359 
KWOMTARI-BAIBAI, 178f, 
182, 359 


LAGOON, 308 
LAK-DARGWA, 74, 324 
LAKE PLAIN, CENTRAL, 
356 
LAKE PLAIN, EAST, 358 
LAKHER, 333 
LAKKIA-KAM-TAI, 337—38 
LAMALAMIC, 363 
LAMET-KHAMET, 335 
LAMGANG, 332 
LAMOGAI, 349 
LAMPUNG, 344 
LAND DAYAK, 343 
LANGBASI-NGUBU, 307 
LANGO-ACHOLI, 318 
LANOH-SEMNAM, 336 
LAPPIC-»SAAMIC, 68, 328 
LAQUA-LAHA, 337 
LARAGIYAN, 189, 362 
LARU-LOPA, 310 
LATI-GELAO, 154, 337 
LATINO-FALISCAN, 326 
LEGA-KALANGA, 313 
LENJE-TONGA, 315 
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LEONARD SCHULTZE, 
178, 360 
LEZGIAN, 74, 324 
LI-KAM-TAI, 154, 337-38 
LI-LAQUA, 337 
LIBON-DARAGA, 340 
LIGBI-NUMU, 303 
LOBI-DAN, 305 
LOGOLI-KURIA, 314 
LOHORONG, 332 
LOLO, 146, 333 
LOLO, CENTRAL, 333 
LOLO, EAST, 333 
LOLO, NORTHERN, 333 
LOLO, RESIDUAL, 333 
LOLO, SOUTHERN, 333 
LOTUXO-MAA, 318 
LOTUXO-TESO, 110, 318 
LOYALTY ISLANDS, 167, 
351 
LUBA, 315 
LUIC, 333 
LULE-VILELA, 241, 375 
LUNDU-BALONG, 312 
LUO, 109f, 317-18 
LUO-ACHOLI, 317-18 
LURI, 326 
LUSHOOTSEED-TWANA, 
366 
LUWOID, 325 
LUZON, CENTRAL, 339 
LUZON, NORTHERN, 
338—39 


MABAN, 81, 84, 108f, 111, 


317 
MABAN-BURUN, 317 
MABANG, 317 
MABUAN, 357 
MABUSO, 357 
MACRO-ARAWAKAN, 240, 


373-74 
MACRO-CARIB, 220, 
228-30, 238, 240, 289, 
374-75 
MACRO-GE, 220, 228—350, 
238, 241, 289, 376-77 
MACRO-PANOAN, 220, 
228-30, 238, 241, 289, 


375-76 
MACRO-TUCANOAN, 
228-30, 238f, 289, 372 
MADAK, 175, 349 
MADANG (MADANG- 
ADELBERT RANGE), 
178, 182, 356-57 


422 


MADANG (SEPIK- 
MADANG), 347 
MADANG-ADELBERT 
RANGE, 356—57 
MADEIRA, 373 
MAGORI, 349 
MAHAKAM, 343 
MAIDUAN, 208-18, 255, 
368 
MAILUAN, 178, 182, 356 
MAIMAI, 178, 359 | 
MAIN SECTION (of 
TRANS-NEW GUINEA 
phylum), 177-83, 256, 
354-56 
MAIPURAN, 240, 374 
MAIRASI, 356 
MAIRASI-TANAH 
MERAH, 178, 182, 356 
MAJANG-MIKEYIR, 317 
MAKAA-NJEM, 312 
MAKU, 372 
MAKUA, 315 
MAL, 335 
MAL-KHMU', 335 
MALAITA, 351 
MALAITA-SAN 
CRISTOBAL, 351 
MALAKMALAK, 362 
MALAY-MOKLEN, 344 
MALAY, ABORIGINAL, 
128, 152, 344 
MALAY, LOCAL, 344 
MALAY, PARA-, 344 
MALAYAN, 344 
MALAYIC, 344 
MALAYO-POLYNESIAN, 
151, 159, 162, 165, 167, 
338-53 
MALAYO-POLYNESIAN, 
CENTRAL, 167, 345—46 
MALAYO-POLYNESIAN, 
CENTRAL-EASTERN, 
165#, 345-53 
MALAYO-POLYNESIAN, 
EASTERN, 165ff, 346-53 
MALAYO-POLYNESIAN, 
WESTERN, 165-68, 
338-44 
MALEKULA CENTRAL, 


352 

MALEKULA COASTAL, 
352 

MALEKULA INTERIOR, 


352 
MALUKU, CENTRAL, 167, 
345-46 


MALUKU, SOUTHEAST, 
167, 346 
MAMBILA-VUTE, 105, 311 
МАМЕАМ, 369 
MAMEAN, GREATER, 369 
MAMEE, 103, 311 
MANAM, 347 
МАМСНАТІ, 331 
MANDA, 95, 315 
MANDA-KUI, 330 
MANDA-PENGO, 330 
MANDARA, 321 
MANDARA-MATAKAM, 
321 
MANDARIN-YUE, 331 
MANDAYAN, 341 
MANDE, 81, 83f, 97-101, 
278, 302-3 
MANDEKAN, 303 
MANGAYA-TOGOYO, 306 
MANGBETU-ASUA, 109, 
112, 319 
MANGBUTU-EFE, 112, 319 
MANGERRIAN, 362 
MANGUEAN, 208, 215, 
228f, 237f, 371 
MANGYAN, SOUTH, 340 
MANJAKU-PAPEL, 303 
MANOBO, 340 
MANSAKAN, 341 
MANTHARDA, 189, 364 
MANUBARAN, 178, 182, 
356 
MANUS, 350 
MANZA-NGBAKA, 306 
МАО, 92f, 322 
MAPUDUNGU, 372 
MARAN, 189, 362 
MARANAO-IRANON, 340 
MARI, 363 
MARIC (MARAN), 362 
MARIC (PAMA- 
NYUNGAN), 189, 363 
MARIENBERG, 178, 360 
MARIND, 175, 178, 182, 


355 
MARITIPARANA, 374 
MARKHAM, 167, 348 
MARNGU, 189, 364 
MARQUESIC, 353 
MASA, 92, 320 
MASABA-LUYIA, 314 
MASCOIAN, 241, 375 
MATACO, 241, 375 
MATACO-GUAICURU, 241, 


375 
MATAKAM, 321 
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MATTOLE-WAILAKI, 365 

MATUMBI, 315 

MAYABIC, 363 

MAYAN, 208, 213-30 pas- 
sim, 234f, 253, 368-69 

MAYOGO-BANGBA, 307 

MBANZA-MBANJA, 307 

MBEDE, 313 

MBOSHI, 313 

MBULUNGISH-NALU, 101, 
303 

MBUM, 306 

MBUM-MUNDANG, 102, 
306 

MBUNDU, 314 

MBUNDU, SOUTH, 316 

MEAX, 359 

MEK, 178f, 182, 358 

MEL, 101, 304 

MENCHUM, 312 

MENDE-BANDI, 303 

MENGEN, 349 

MESO-PHILIPPINE, 165, 
167, 278, 340-41 

MEXICAN PENUTIAN, 
214, 228, 235, 368-609 

MIAO, 152, 334 

MIAO-YAO, 126, 143ff, 
148-56, 261, 287, 334 

MICRONESIAN (REMOTE 
OCEANIC), 163, 165, 
167, 351 

MICRONESIAN (SAMOIC 
OUTLIER), 353 

MILNE BAY, 348-49 

MILNE BAY-CENTRAL 
PROVINCE, 167, 348-49 

MIN, 331 

MINANIBAI, 357 

MINDANAO, SOUTH, 165, 
167, 278, 341 

MINDJIM, 357 

MINDORO, NORTHERN, 


339 
MINIAFIA-UBIR, 348 
MINIHASAN, 342 
MIRIWUNGIC, 362 
MISEGIAN, 361 
MISSISSIPPI VALLEY, 233, 


367 
MISUMALPAN, 228, 238, 


371 
MIWOK, 209, 211, 217f, 
368 
MIWOK-COSTANOAN, 
210, 214, 217, 228f, 235, 
368 
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MIXE-ZOQUE, 208, 213-20 
passim, 228, 230, 235, 368 

MIXTEC-CUICATEC, 370 

MIXTECAN, 208, 215, 237, 
370 

MIYAN, 363 

МІ/О, 333 

ММОМС, 336 

MODANG, 343 

MOHAWK-ONEIDA, 233, 
367 

MOIL, 362 

MOKLEN, 3 

MOLALE-CAYUSE, 209, 
214, 368 

MOMBUM, 555 

MOMO, 312 

MOMO-NJEN, 312 

MON-KHMER, 125, 143, 
148—56 passim, 287, 
334-37 

MONDE, 373 

MONDUNGA, 102, 307 

MONGO, 313 

MONGOL-LANGAM, 360 

MONGOLIAN, 125, 129, 
132f, 287, 328 

MONGOLIAN-TUNGUS, 
1275, 131f, 328 

MONGONDOW, 342 

MONGONDOW- 
GORONTALO, 342 

MONGUOR, 133, 329 

MONIC, 151, 155f, 336 

MONTANA, 374 

MONUMBO, 178, збо 

MOREHEAD & UPPER 
MARO RIVERS, 358 

MORU-MADI, 109, 112, 
318 

MOSAN, 213-16, 228-30, 
288, 366 

MOSETEN, 375 

MOSO, 146, 333 

MOTU-SINAGORO, 349 

MUKAWA, 348 

MULLER-SCHWANER 
“PUNAN,” 343 

MUMUYE, 306 

MUMUYE-YENDANG, 102, 
306 

MUNA-BUTON, 342 

MUNDA, 125, 148, 150-56 
passim, 287, 334 

MUNDANG, 306 

MUNDARI, 148, 334 

MUNJI-YIDGHA, 325 


MURA, 238, 371 

MURUT, 342 

MURUT-TIDONG, 342 

MUSKOGEAN, 208-18 pas- 
sim, 234f, 368 

MZAB-WARGLA, 320 


NA, 308 

NA-DENE, 195—200, 204, 
210—23 passim, 228f, 258, 
260f, 288, 365 

NA-DENE, CONTINEN- 
TAL, 365 

NAGA, EASTERN, 146, 


332 
NAGA, NORTHERN, 146, 


332 

NAGA, SOUTHERN, 146, 
332 

NAKANAI, 349 

NAMA, 77f, 81, 116, 301 

NAMBAS, SMALL, 352 

NAMBIKUARAN, 230, 239, 
372 

NAMBU, 358 

МАМАЈ, 133, 329 

NANCERE, 320 

NANDI-MARKWETA, 318 

NASIOI, 361 

NASO, 333 

NATCHEZ-MUSKOGEAN, 
214, 216f, 235, 368 

NATIORO-WARA, 304 

NAVAJO-APACHE, 199, 365 

МАХ, 736, 324 

NDOGO-SERE, 306 

NDONGA, 316 

NDOP, 312 

NDU, 175, 360 

NEMI, 351 

NEW BRITAIN (EAST PA- 
PUAN), 176, 181, 361 

NEW BRITAIN 
(OCEANIC), 167, 349 

NEW CALEDONIA, 167, 


351 
NEW GEORGIA, 167, 181, 


350 

NEW GUINEA HIGH- 
LANDS, EAST, 176-82 
passim, 354—55 

NEW GUINEA, CENTRAL 
& SOUTH, 176-82 pas- 
sim, 355 

NEW HEBRIDES, CEN- 
TRAL, 167, 352 

NEW HEBRIDES, CEN- 


423 


TRAL & NORTHERN, 
167, 181, 352 
NEW HEBRIDES, EAST, 352 
NEW HEBRIDES, NORTH- 
EAST-BANKS ISLANDS, 


352 

NEW HEBRIDES, SOUTH- 
ERN, 167, 350 

NEW IRELAND-TOLAI, 


349 
NEW IRELAND, NORTH- 
ERN, 167, 349 
NEW МЕХІСО PENU- 
TIAN, 235, 368 
NEWARI-PAHRI, 331 
NGADJUNMAYA, 364 
NGALIK-NDUGA, 356 
NGANDO, 313 
NGANGEA-SO, 318 
NGARGA, 189, 364 
NGARINYERIC- 
YITHAYITHIC, 189, 364 
NGAYARDA, 189, 364 
NGBANDI, 306 
NGEMBA, 311 
NGEQ-NKRIANG, 335 
NGERO, 348 
NGILE-DENGEBU, 302 
NGOMBE, 313 
NGUMBIN, 189, 364 
NGUNDI, 313 
NGUNDU-GOLO, 307 
NGUNDU-KPAGUA, 307 
NGUNI, 316 
NGURA, 564 
NICOBAR ISLANDS, 151, 
155f, 337 
NIGER-CONGO, 81-84, 
95-99 passim, 255, 
302-16 
МІСЕК-СОМСО, CEN- 
TRAL, 100, 304-16 
МІСЕК-СОМСО, NORTH 
CENTRAL, 100, 304-7 
NIGER-CONGO, SOUTH 
CENTRAL, 100, 103, 
307—16 
NIGER-KADUNA, 308 
NIGER-KORDOFANIAN, 
76, 83f, 95-106, 258, 260, 
286, 301—16 
NIGER, CENTRAL, 308 
NIGER, LOWER, 100, 103, 


309 

NILO-SAHARAN, 76, 
83-84, 107—13, 253-61, 
286, 317—19 


424 


NILOTIC, 811, 109-12, 
217-18 

МІМВОКАМ, 178, 182, 358 

МІМІСО, 349 

NINZAM, 310 

NINZAM-RUKUBA, 310 

NJEBI, 313 

NJEN, 312 

МКАМВЕ, 311 

МКОМ, 312 

МКОҮА, 315 

NOLGIN, 362 

NOOKSACK, 366 

МОК, 360 

NOR-PONDO, 175, 178, 
360 

NORMAN, 3563 

NORTH SEA, 327 

NORTH-NEHAN, 350 

NORTHERN PLAINS, 233, 


367 
NUBIAN, 110, 112, 286, 
317 
NUER, 109, 318 
NUKUMA, 360 
NUMAGENAN, 357 
NUMIC, 209-14 passim, 
228, 236f, 370 
NUN, 311 
NUN-BAMILEKE, 311 
NUNUSAKU, 345 
NUPE, 308 
NUPE-GWARI, 308 
NURISTANI, 56, 58, 278, 
325 
NURU, 357 
NYAHON-PROUAC, 336 
NYAWAYGIC, 189, 363 
NYIKA, 314 
NYIKA-SAFWA, 315 
NYILAMBA-LANGI, 314 
NYIMA, 103, 310-16 
NYIMANG, 112, 317 
NYO, 307-8 
NYORO-LUGANDA, 314 
NYULNYULAN, 189, 362 
NYUNGAR, 364 
NZEMA-AHANTA, 308 


OAS-DARAGA, 340 

OB-UGRIC, 68, 328 

OCEANIC, 151, 165, 167, 
255f, 288, 347-53 

OCEANIC, REMOTE, 167, 
351-53 

OCORONO, 373 

OGONT, 309 





OI-THE, 336 

OIRAT-KALMYK, 133, 329 

OIRAT-KHALKHA, 133, 
329 

OK, 179, 355 

OMETO, 286, 322 

OMETO-GIMIRA, 322 

OMO-TANA, 323 

OMOSAN, 357 

OMOTIC, 87, 89, 91-93, 
286, 322 

ONGAMO-MAA, 318 

ONIN, 346 

OPONE-CARARE, 375 

OREGON (ATHABAS- 
KAN), 199, 365 

OREGON (PENUTIAN), 
235, 367 

ORINOCO, 374 

OROMOID, 323 

ORON, 309 

OSCO-UMBRIAN, 326 

OTI-VOLTA, 305 

OTO-MANGUEAN, 217-20 
passim, 228-30, 237f, 
289, 370-71 

OTOMIAN, 208, 215, 228, 
2374, 370 

OTTILIEN, 361 

OTUKE, 376 


PACIFIC COAST 
ATHABASKAN, 199, 228 
PACIFIC, CENTRAL, 167, 
176, 181, 352-53 
PACOH-PHUONG, 335 
PAEZAN, 208, 220, 238-39, 


371 
PAEZAN, NUCLEAR, 238, 


371 
PAHOTURI, 358 
РАІ, 369 
PAITANIC, 342 
PAIWANIC, 165ff, 338 
PALAIHNIHAN, 208-18 
passim, 234, 369 
PALAUNG, 334 
PALAUNG-RIANG, 334 
PALAUNGIC, 151, 155, 


334-55 
PALAUNGIC-KHMUIC, 
155f, 334-35 
PALAWAN, 340 
PALEI, 359 
PALKU, 364 
PAMA-NYUNGAN, 187ff, 
362-64 
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PAMAN, 189, 363 
PAMIR, 325-26 
PANGASINIC, 339 
PANO-TACANA, 241, 376 
PANOAN, 208, 229, 241, 
376 
PANTAR, 359 
PAPI, 360 
PAPUAN, EAST, 159, 162, 
172, 174, 178-982, 361 
PAPUAN, WEST, 159, 162, 
172—82 passim, 359 
PARANA, 374 
PARESSI-SARAVECA, 374 
PARJI-GADABA, 330 
РА5ТО, 371 
PATAGONIAN, 239, 372 
PATPATAR-TOLAI, 349 
PAUWASI, 178, 182, 358 
PAWNEE, 232, 367 
PAWNEE-KITSAI, 232, 367 
PEARIC, 155f, 336 
PEBA-YAGUAN, 240, 374 
PEKA, 356 
PENUTIAN, 211-17, 220, 
223, 228-30, 234f, 237, 
288, 367-69 
PERMIC, 68f, 328 
PERSIAN, 326 
PHILIPPINES, CENTRAL, 
168, 340-41 
PHILIPPINES, NORTH- 
ERN, 165, 167, 338-39 
PHILIPPINES, SOUTH- 
ERN, 165, 167, 339-40 
PHO, 331 
PHO-PHLON, 331 
PHUNOI, 333 
PIAPOCAN, 374 
PIAROA-SALIBA, 240, 372 
PIHOM, 357 
PIHOM-ISUMRUD- 
MUGIL, 357 
PIMAN, 209, 237, 370 
РІМІС, 209, 214, 2364, 370 
РІКО, 370 
PIRU BAY, 345-46 
PLATEAU (МҮІМА), 103, 
278,310 
PLATEAU (PENUTIAN), 
214, 255, 278, 368 
POCHUTLA, 370 
POCOM, 369 
POGORO, 314 
POK-TERIK, 318 
POLYNESIAN, 165, 165, 


255, 255, 352-53 
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POLYNESIAN, EASTERN, 
165, 353 
POLYNESIAN, NUCLEAR, 


353 
РОМО, 209-18 passim, 
229, 234, 369 
POMOIKAN, 357 
PONAPEIC, 351 
PONAPEIC-TRUKIC, 351 
PONDO, 360 
POPAYAN, 371 
POPOLOCAN, 237f, 
370-71 
POTU, 308 
PRE-ANDINE MAIPU- 
RAN, 374 
PUINAVE-MAKU, 239, 372 
PURI, 241, 376 


QAWASQAR-YAMANA, 
239, 372 

QUECHUAN, 239, 372 

QUICHEAN, 228, 369 

QUICHEAN, GREATER, 
369 


RAGLAI, 344 

RAI COAST (MADANG- 
ADELBERT RANGE), 
175, 179, 356-57 

RAI COAST (RAI COAST- 
NORTHWEST NEW 
BRITAIN), 175, 179, 348 

RAI COAST-NORTHWEST 
NEW BRITAIN, 175, 179, 
348 

RAJA ЕМРАТ, 347 

RAM, 360 

RAMA, 238, 371 

КАМАКАМА, 373 

RAMU, 1784, 360—61 

RASHAD, 99, 302 

RAUTO, 349 

RAWANG, 333 

REEF ISLANDS-SANTA 
CRUZ, 1784, 361 

REJANG-BARAM, 343 

REJANG-BINTULU, 343 

REJANG-SAJAU, 343 

REJANG, LOWER, 343 

RENGMA, 332 

RHAETO-ROMANCE, 326 

RIFT, 323 

RING, 312 

RIO NEGRO, 374 

RITWAN, 211--17 passim, 
222, 228—31, 288, 366 


ROINJI-NENGAYA, 348 

ROMANCE, 30—31, 33, 37, 
57, 59, 326 

ROMANY, 38, 325 

ROMBLON, 341 

RON, 321 

ROTOKAS, 361 

ROTUMAN-FIJIAN, 352 

RUBONI, 361 

RUNGA-KIBET, 317 

RUNGU, 315 

RUSSIAN RIVER, 369 

RWANDA-RUNDI, 314 


SAAMIC, 33, 68f, 278, 328 
SAHAPTIN-NEZ PERCE, 
208, 217, 368 
SAHARAN, 81, 84, 108-9, 
111f, 278, 317 
SALAMPASU-NDEMBO, 


315 
SALINAN-CHUMASH, 
211, 214, 234, 369 
SALISH, 208-20 passim, 
228—532. passim, 288, 366 
SAM, 323 
SAMA-BAJAW, 165, 167f, 


343-44 

SAMA, INNER SULU, 343 

SAMA, WESTERN SULU, 
345 

SAMAR-WARAY, 341 

SAMBALIC, 339 

SAMO, 303 

SAMOIC OUTLIER, 353 

SAMOYED, 65—70 passim, 
125, 129, 328 

SAMRE, 336 

SAN CRISTOBAL, 351 

SANAGA, 312 

SANGIR-MINAHASAN, 
342 

SANGIRIC, 342 

SANGLECHI-ISHKASHMI, 
325 

SANGTAM, 332 

SANIO, 360 

SANTA CRUZ (EAST PA- 
PUAN), 176, 181, 361 

SANTA CRUZ (OCEANIC), 
167, 176, 181, 350 

SANTA ISABEL, 167, 350 

SANTALI, 148, 334 

SANTO, EAST, 352 

SANTO, WEST, 352 

SAPOSA-TINPUTS, 350 

SARA, 318 
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SARA-BAGIRMI, 318 
SARDINIAN, 326 
SARMI, 347 
SARMI-YOTAFA, 167, 347 
SCYTHIAN, EAST, 128, 
325-26 
SCYTHIAN, WEST, 325 
SEDANG, 336 
SEDANG-TODRAH, 336 
SEMANG, 155, 337 
SEMANG-JAH HUT, 337 
SEMANG-SENOIC, 337 
SEMITIC, 77, 81-92 pas- 
sim, 253, 259f, 286, 323 
SEMNANI, 326 
SENAGI, 178f, 182, 358 
SENECA-CAYUGA, 233, 367 
SENECA-ONONDAGA, 
233, 367 
SENEGAL, 303 
SENEGAL, EASTERN- 
GUINEA BISSAU, 101, 


303 
SENGA-SENA, 315 
SENOIC, 337 
SENTANI, 178, 182, 356 
SENUFO, 304 
SENUFO-TUSYA, 102, 304 
SEPIK (SEPIK-MADANG), 


347 

SEPIK (SEPIK-RAMU), 178, 
181, 360-61 

SEPIK HILL, 179, 360 

SEPIK-MADANG, 347 

SEPIK-RAMU, 178-82 pas- 
sim, 360 

SEPIK, MIDDLE, 360 

SEPIK, UPPER, 175, 360 

SERAM, 345 

SERAM STRAITS, 346 

SERAM, CENTRAL, 345 

SERAM, EAST, 345 

SERAM, NORTHWEST, 345 

SERAN LAUT, 345 

SERE, 306 

SERI-YUMAN, 214, 217, 
228, 234, 369 

SERRAN, 237, 370 

SGAW, 331 

SGAW-BWE, 331 

SHAMBALA, 314 

SHI-HAVU, 314 

SHINA, 325 

SHO, 333 

SHONA, 316 

SHUGHNI-YAZGULAMI, 
326 
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SIANE, 354 

SIASSI, 167, 347-48 
SIBERIAN YUPIK, 329 
SIDAMO, 322 

SIERRA DE PERIJA, 375 
SIKAN, 357 

SIMI, 532 
SINHALESE-MALDIVIAN, 


325 
SINICIZED PAIWANIC, 338 
SINITIC, 143f, 146, 150, 331 
SINO-TIBETAN, 126, 141- 
47, 150, 260f, 287, 331-33 
SIONA-SECOYA, 372 
SIOUAN, 2086, 214-20 pas- 
sim, 229, 232f, 367 (| 
SIOUAN-YUCHI, 213f, 230, 
288, 367 
SIRA, 313 
SKO, 175-82 passim, 359 
SLAVIC, 33, 376, 42, 46, 
54—60 passim, 253, 257, 


327 
SMALL NAMBAS, 352 
SO-BRU, 335 
SOGDIAN, 325 
SOKORO, 320 
SOLOMONS, CENTRAL, 
175f, 181, 361 
SOLOMONS, SOUTH- 
EAST, 167, 176, 181, 351 
SOMALI-AWEER, 323 
SOMRAI, 320 
SONGHAI, 84, 111, 317 
SONGYE, 315 
SONINKE-BOZO, 302 
SORA-GORUM, 334 
SORA-JURAY, 334 
SOTHO-TSWANA, 316 
SRE-MNONG, 336 
ST. MATTHIAS, 349 
STIENG-CHRAU, 335 
STRICKLAND, EAST, 355 
SUAU, 348 
SUBANUN, 339 
SUBIA, 315 
SUDANIC, CENTRAL, 
81-84 passim, 110-12, 
18-1 
SUDANIC, EAST, 81-84, 
110ff, 278, 317-18 
SUKUMA-NYAMWEZI, 314 
SULA, 345 
SULA-TALIABO, 345 
SULAWESI, CENTRAL, 342 
SULAWESI, SOUTH, 342 


SULU-BORNEO, 343 
SUMATRA, 344 
SUNDIC, 165, 1674, 344 
SUPPIRE-MIANKA, 304 
SURIGAO, 341 
SURMA, 112, 317 
SUSU-YALUNKA, 302 
SWAHILI, 314 


TA-ARAWAKAN, 374 

TA-OY-TONG, 335 

ТАҒА, 119, 301 

TACANAN, 208, 228, 241, 
278, 376 

TAGALOG, 340 

TAGBANA-DYIMINI, 304 

TAHITIC, 353 

TAHLTAN-KASKA, 199, 
365 

TAI, 15, 143ff, 150—54 pas- 
sim, 337-38 

TAIKAT, 358 

TAIRORA, 354 

TAKELMA, 367 

TAKELMAN, 367 

ТАКІС, 228, 236f, 370 

TALAMANCA, 238, 371 

ТАГООІ, 99, 302 

TALYSH, 326 

TAMA (EAST SUDANIC), 
112, 317 

TAMA (SEPIK), 360 

TAMA-SUNGOR, 317 

TAMANG, 331 

TAMIL, 138f, 330 

TAMIL-IRULA, 330 

TAMIL-KANNADA, 139ff, 


330 
TAMIL-KODAGU, 139, 330 
TAMOLAN, 361 
TANAINA-AHTNA, 199, 
365 
TANANA, 365 
TANANA-UPPER 
KUSKOKWIM, 199, 365 
TANGALE, 321 
TANGIC, 189, 362 
TANNA, 350 
TANOAN, 208f, 213-23 
passim, 228ff, 236, 289, 


370 

TARACAHITIC, 209, 236f, 
370 

TARAHUMARA- 
GUARIJIO, 237, 370 

TARIANAN, 374 
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TAROK, 310 
TASMANIAN, 176, 180-81, 
188, 354 
TATAR, 33, 128f, 328 
TATO, 110, 318 
TATUYO, 372 
TAUPOTA-WEDAU, 348 
TEBERAN, 357 
TEBERAN-PAWAIAN, 178, 
182, 357 
TEKE, 313 
TELUGU, 140, 330 
TELUGU-KUI, 139ff, 330 
TEMEIN, 81, 112, 317 
TEMNE, 304 
TENETEHARA, 373 
TEPEHUAN, 237, 370 
TEQUISTLATECAN, 208, 
213—19 passim, 228, 234, 
369 
TERA, 320 
TESO-TURKANA, 318 
TETELA, 313 
TEWA-TIWA, 236, 370 
THAN, 363 
THREE RIVERS, 345 
TIATINAGUA, 376 
TIATINAGUA-TACANA, 
376 
TIBETAN, 128, 146, 331 
ТІВЕТІС, 146, 331—532 
TIBETO-BURMAN, 143-46 
passim, 150, 331—335 
TIBETO-KAREN, 144, 
331—33 
TIBORAN, 357 
TICUNA-YURI, 239, 372 
TIDONG, 342 
TIKAR, 312 
TILLAMOOK, 366 
TIMOR, 175, 346 
TIMOR-ALOR-PANTAR, 
176—82 passim, 278, 359 
TIMOR-FLORES, 346 
TIMOTE, 208, 240, 372 
TINIGUAN, 240, 373 
TIRIO, 358 
TIV-BATU, 103, 311 
TIWA, 236, 289, 370 
TIWA-PIRO, 236, 370 
TLAPANECAN, 213-19 
passim, 228, 234, 369 
TOCHARIAN, 35-36, 56ff, 
125, 286, 325 
TODA-KOTA, 138f, 330 
TODRAH, 336 
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TODRAH-MONOM, 336 

TOGO, 98, 103, 308 

TOGO, CENTRAL, 103, 308 

TOLOWA-GALICE, 365 

ТОМІМІ, 342 

TONDA, 358 

TONGIC, 352 

TONGWE, 314 

TOR, 358 

TOR-LAKE PLAIN, 358 

TORRICELLI, 75-82 pas- 
sim, 359—60 

TOTONACAN, 208, 213-23 
passim, 228, 230, 235, 368 

TRANS-FLY, 179, 358 

TRANS-FLY-BULAKA 
RIVER, 178, 182, 358 

TRANS-FLY, EASTERN, 358 

TRANS-NEW GUINEA, 
177-83, 256, 354-59 

TRUKIC, 351 

TSAMOSAN, 232, 366 

TSETSAUT, 365 

TSHU-KHWE, 301 

TSOGO, 313 

TSOUIC, 165ff, 338 

TSWA-RONGA, 316 

TUAREG, 93, 286, 320 

TUCANOAN, 220, 229, 239, 
372 

TULU, 140, 330 

TUNEBO, 371 

TUNGUS, 125-33 passim, 
287, 329 

TUNICA-ATAKAPA, 214, 

368 

TUNICA-CHITIMACHA, 
214, 216f, 223, 235, 368 

TUPARI, 373 

TUPI, 191, 208, 220, 240, 


373 
TUPI-GUARANI, 289, 373 
TURAMA-KIKORIAN, 
178f, 182, 357 
TURAMA-OMATIAN, 357 
ТОККІС, 125--33 passim, 
287, 328 
ТОККІС, СОММОМ, 133, 
328 
TURKWAM, 310 
TUSCARORA, 233, 367 
TUTCHONE, 199, 365 
TUVA-ALTAI, 328 
TUVA-KARAGAS, 328 
TZELTALAN, 368 
TZELTALAN, GREATER, 368 


UBANGIAN, 306-7 

UDIHE, 133, 329 

UGRIC, 65, 68, 70, 328 

ULAT ІМАІ, 345 

ULIASE, 346 

UMPOUA, 365 

UPLAND YUMAN, 369 

URALIC, 24, 66—7o passim, 
125, 129, 258f, 261, 328 

URALIC-YUKAGHIR, 64- 
71, 258, 280, 287, 328 

URARINA-WAORANI, 239, 

371 

URTWA, 364 

URUWA, 354 

UTO-AZTECAN, 205-13 
passim, 217ff, 228ff, 2366, 
289, 370 

UTUPUA, 350 


VAI-KONO, 303 
VANIKORO, 350 
VANIMO, 359 
VAYU-CHEPANG, 331 
VIET-MUONG, 151f, 155f, 


335 
VOLGAIC, 69, 328 
bc ie 103, 
7-8 
VOLTAIC—>GUR, 83, 
95-102 passim, 304 


WA-LAWA, 335 

WADJARI, 189, 364 

WAGAYA-WARLUWARIC, 
189, 364 

WAGAYDY, 362 

МАНСІ, 355 

УУАТВГК, збо 

WAIC, 335 

WAICURI-OUINIGUA, 234, 


369 
WAIMA’ A, 346 
WAJA, 102, 305 
WAKA-KABIC, 189, 363 
WAKASHAN, 209-18 pas- 
sim, 228ff, 232, 288, 366 
WALING, 332 
WALIO, 360 
WAMBAYAN, 362 
WANANG, 357 
WANTOAT, 354 
WANYAM, 373 
WAEEI, 359 
WAPEI-PALEL 178, 359 
WAPEI, WEST, 178, 359 
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WAPISHANAN, 374 

WARAYAN, 341 

WARIS, 358 

WARJI, 322 

WARLUWARIC, 364 

WARUP, 354 

WASHINGTON PENU- 
TIAN, 235, 367 

WATI, 189, 364 

WATUT, LOWER, 348 

WE, 504 

WEL, 103, 310—16 

WEST ATLANTIC, 83, 97f, 
100, 305—4 

WEST BARKLY, 188f, 362 

WESTERN-GURMA, 305 

WHITEMAN, 349 

IWI, 119, 301 

WILLAUMEZ, 349 

WINTUN, 209-18 passim, 
235, 368 

WIRADHURIC, 189, 363 

WIRINA, 374 

WISSEL LAKES-KEMAN- 
DOGA, 178, 182, 356 

WITOTOAN, 289, 375 

WOGAMUSIN, збо 

WORORAN, 362 

WURBO, 309 


XINGU, 374 
XINGU BASIN, 375 


YAAKUAN-DULLAY, 322 

YABIM-BUKAWAC, 348 

YABUTI, 241, 376 

YAGANON, 356 

YAKA, 314 

YAKONAN, 209, 214, 217, 
367 

YAKUT, 328 

YALANJIC, 189, 363 

YAMINAHUA-SHARANA- 
HUA, 376 

YANAN, 208-18 passim, 
234, 369 

YANGMANIC, 362 

YANOMAM, 238, 571 

YANZI, 313 

YAO (BANTU, ZONE D), 


315 
YAO (MIAO-YAO), 152, 
154, 334 
YAPEN, 346 
YAPURA A, 374 
YAPURA В, 374 
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ҮАОАҮ,355 
ҮАКЕВАМ, 178, 182, 356 
YARLI, 364 
YELE-SOLOMONS, 178, 


361 
YELE-SOLOMONS-NEW 
BRITAIN, 361 
YELLOW RIVER, 360 
YENDANG, 306 
YESKWA, 310 
YI, 307-8 
YIDINYIC, 189, 363 
YIWAIDJAN, 1884, 362 
YIWAIDJIC, 362 
YOKUTS, 209-18 passim, 
235, 368 
YOM-NAUDEM, 305 
YORUBA, 100, 308 
YORUBA-NORTHERN 
AKOKO, 103, 308 


YOTAFA, 347 
YOTAYOTIC, 189, 363 
YUAT, 360 
YUAT-MARAMBA, 360 
YUAT-WAIBUK, 360 
YUCATECAN, 228, 368 
YUIN, 363 
YUIN-KURIC, 189, 363 
YUKAGHIR, 70, 126, 328 
YUKI-WAPPO, 209, 217f, 
228, 235, 368 
YUKUBEN-KUTEB, 309 
YUMAN, 208-18 passim, 
228f, 234, 369 
YUMBRI-MRABRI, 335 
YUNGUR, 102, 306 
YUNGUR-ROBA, 306 
YUPIK, 194, 329 
YUPNA, 354 
YURA, 189, 364 
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YURUNA, 373 
YUULNGU, 189, 362 


ZAAR, 322 
ZAMUCOAN, 240, 373 
ZAN, 324 
ZANDE, 102, 307 
ZANDE-NZAKARA, 307 
ZAPAROAN, 208, 371 
ZAPOTECAN, 208, 228, 
2374, 257, 371 
ZAZA-GORANI, 326 
ZENATI, 95, 320 
ZENATI, EAST, 320 
ZIGULA-ZARAMO, 314 
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This index contains all of the roughly 5,000 languages classified іп Chapter 8. Тһе 
number following each language indicates the page on which that language appears 
in the classification (section 8.6). When more than one number follows a language 
the final number indicates the page number іп the classification, the others are ref- 
erences in the text of the book. 

Compound language names are alphabetized according to the following prin- 
ciples. (1) If the first word is a common adjective, then the language is alphabetized 
according to the second word (e.g. “Eastern Ротпо” is to be found under “Pomo, 
Eastern” and “Alaskan Inuit” under “Inuit, Alaskan”). (2) If both words appear 
meaningless to the average reader, the language name is alphabetized according to 
the first word (e.g. "Abai Sungai” and “Jaya Bakti” are listed as such in the index). 
(3) In a limited number of cases a language name is listed under both its first and 
second words (e.g. both "Kelinyao Kenyah" and "Kenyah, Kelinyao" appear in the 
index). This dual approach is followed because there are in fact other "Kenyah" 
languages, even though to most people both terms seem equally opaque. 

It is not uncommon for two (or more) entirely different languages to have the 
same name. When they are genetically and/or geographically close to each other 
scholars sometimes distinguish them by subscripts (e.g. Mimi, and Mimi,, Mor, 
and Mor,). Usually, however, homonymous names are not even noticed, much less 
distinguished. For example, in addition to the fairly well-known Yao (= Mien) lan- 
guage of Southeast Asia, there are two other languages called Yao in the classifica- 
tion, one a member of the Bantu family in Africa, the other belonging to the Carib 
family of South America. Homonymous names are distinguished in the index by 
following each with its family affiliation, in this case: 


Yao (BANTU) 
Yao (CARIB) 
Yao (MIAO-YAO) 


More perplexing than the problem of identical names is the fact that in some in- 
stances the name most often associated with a language is, for reasons discussed in 
section 8.1, replaced by a different name in the classification. I have tried to antici- 
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pate where such replacements are likely to confuse the average reader and have pro- 
vided cross references in such cases. Thus, “Gypsy—> Romany, 286, 325” means that 
the language of the Gypsies is called Romany and is listed as such on those pages. I 
have, however, used this system of cross-references sparingly and only for the most 
common variants. This is an index to the world's languages as listed in section 8.6. 
Both Voegelin and Voegelin (1977) and Grimes (1984) provide indexes to the more 
than 25,000 names associated with the world's languages and their dialects and ei- 
ther book may be used as a more general index to this volume to identify other lan- 
guage and dialect names. 


Grimes, Barbara F., ed. 1984. Index to the Tenth Edition of Ethnologue: Languages of the 
World. Dallas. 
Voegelin, C. Е, and F. M. Voegelin. 1977. Classification and Index of the World's Lan- 


guages. New York. 


A-Pucikwar, 354 Adele, 308 Ahtna, 365 
Abaga, 354 Adere, 311 Aibondeni, 347 
Abai Sungai, 342 Adhola, 317 Aiku, 359 
Abaknon, 343 Adi, 531 Aimol, 332 
Abanyom, 311 Adjora, 360 Ainu, 125—32 passim, 259, 
Abasakur, 357 Adygh, 324 261, 287, 329 
Abau, 360 Adynyamathanha, 364 Aiome, 360 
Abaza, 74, 324 Adyukru, 308 Aion, 360 
Abe, 308 Adzera, 348 Airo, 355 
Abelam, 360 Aeka, 356 Airoran, 356 
Abenaki, Eastern, 231, 366 Afar, 322 Aiwo, 361 
Abenaki, Western, 366 Afrikaans, 37, 60, 76, 287, Aizi, 102, 304 
Abia, 356 327 Aja, 318 

Abiji, 308 Agala, 355 Ajie, 351 
Abipon, 375 Agar, 318 Ak, 360 
Abon, 311 Aghem, 312 Aka, 317 
Abron, 308 Aghu, 355 Aka-Bea, 354 
Abua, 309 Aghu-Tharnggala, 365 Aka-Bo, 354 
Abui, 359 Agi, 359 Aka-Cari, 354 
Abure, 308 Agób, 358 Aka-Jeru, 354 
Abxaz, 74, 324 Agoi, 309 Aka-Kede, 354 
Acatec, 369 Agta, 338 Aka-Kol, 354 


Achagua, 374 
Achang, 333 


Agta, Southeast Cagayan, 
338 


Aka-Kora, 354 
Akan, 95, 308 


Achi, 369 Aguacatec, 369 Akani, 329 

Achinese, 161, 344 Aguaruna, 373 Akar-Bale, 354 

Achode, 308 Agul, 324 Akawaio, 375 

Acholi, 318 Agusan, 340 Ake, 310 

Achuar, 373 Agutaynon, 340 Akei, 352 

Achumawi, 369 Agwaagwune, 309 Akha, 353 

Acroa, 377 Agwamin, 363 Akkadian, 55, 90, 92, 323 
Adai, 218, 367 Ahanta, 308 Aklanon, 341 

Adangme, 308 Ahi, 333 Ako, 333 

Adasen, 338 Ahlo, 308 Akoko, Northern, 308 
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Akpa, 308 

Akpafu, 308 

Akrukay, 361 

Akwa, 313 

Akweto, 312 

Alabama, 205, 368 

Alacaluf>Oawasgar, 278, 
372 

Аі ап, 308 

Alago, 308 

Alagwa, 323 

Alak, 336 

Alamblak, 360 

Alangan, 339 

Alas, 344 

Alawa, 362 

Albanian, 33, 36, 46, 54, 
56ff, 286, 326 

Alege, 310 

Aleut, 126, 193, 287, 329 

Alfendio, 360 

Ali, 347 

Allentiac, 371 

Alo-Tegel, 352 

Alsea, 218, 367 

Altai, 328 

Altai, Northern, 328 

Alu, 359 

Alune, 345 

Alur, 348 

Alutiiq, 329 

Alyawarra, 364 

Alyutor, 136, 329 

Ama, 359 

Amadi, 3o7 

Amahai, 346 

Amahuaca, 376 

Amaimon, 357 

Amal, 36o 

Amanab, 358 

Amanaye, 373 

Amarakaeri, 374 

Атпаві, 311 

Ambai, 347 

Ambasi, 356 

Ambelau, 345 

Amber, 347 

Amberbaken, 178, 359 

Amblong, 352 

Ambrym, North, 352 

Ambrym, Southeast, 352 

Amele, 357 

Amharic, 86, 286, 323 

Amho, 376 

Ami, 161, 338 

Amo, 309 


Ampanang, 343 
Amto, 359 
Amuesha, 374 
Amurag, 362 
Amuru, 322 
Amuzgo, 237f, 370 
Amwi, 334 
Anaang, 309 
Anaguta, 309 
Anakalangu, 346 
Anal, 332 
Anambe, 373 
Ancient Egyptian, 84, 88f, 
91, 286, 320 
Andagui, 238, 371 
Andarum, 361 
Andi, 324 
Andio, 342 
Andoa, 371 
Andoke, 240, 374 
Andra, 350 
Andro, 333 
Aneityum, 350 
Anej, 319 
Anem, 361 
Anfillo, 322 
Angaataha, 355 
Angal, 355 
Angami, 332 
Angas, 321 
Angaua, 357 
Anggor, 358 
Angku, 335 
Angoram, 360 
Animere, 308 
Ankave, 355 
Anor, 360 
Anserma, 371 
Ansus, 347 
Antillean Creole, Lesser, 
377 
Anuak, 317 
Anuki, 348 
Anyanga, 308 
Anyi, 308 
Ao, 332 
Aoban, Northeast, 352 
Aoheng, 343 
Aore, 352 
Apache, Eastern, 197, 288, 
365 
Apache, Kiowa, 365 
Apache, Western, 199, 365 
Apalachee, 368 
Apalai, 375 
Apatani, 331 
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Apiaca, 375 

Apiaka 373 

Apinaye, 377 

Apma, 352 

Apurina, 374 

Arabana, 364 

Arabela, 371 

Arabian, Epigraphic South, 


323 
Arabian, South, 92, 323 
Arabic, Classical, 323 
Arabic, Eastern, 286, 323 
Arabic, Western, 323 
Aracaju, 375 
Araki, 352 
Aramaic, 90, 92, 286, 323 
Aramaic, Old, 323 
Aramo, 360 
Aranda, 364 
Aranda, Lower, 364 
Arandai, 358 
Araona, 376 
Arapaho, 231, 366 
Arapesh, Mountain, 359 
Arapesh, Southern, 359 
Arara (CARIB), 375 
Arara (ТОРІ), 373 
Arara (UNCLASSIFIED), 
377 
Arasa, 376 
Araua, 376 
Araucanian—>Mapudungu, 
204, 208, 239, 372 
Aravira, 376 
Arawa, 373 
Arawak, 289, 374 
Arawe, 349 
Arawete, 373 
Arawum, 356 
Arbore, 323 
Archi, 324 
'Are'are, 351 
Arem, 335 
Argobba, 323 
Arguni, 346 
Arha, 351 
Arho, 351 
Ari, 322 
Aricapu, 376 
Aricobe, 377 
Arigibi, 358 
Arikara, 232, 367 
Arikem, 373 
Arinagoto, 375 
Aritinngithigh, 363 
Arma, 371 
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Armenian, 46, 54 56ff, 286, 
325 

Armenian, Classical, 325 

Arosi, 351 

Aru, 359 

Arua (MAIPURAN), 374 

Arua (TUPI), 373 

Aruek, 359 

Arum, 310 

Arumanian, 59, 326 

Aruop, 359 

Arutani, 372 

AS, 347 

Asa, 323 

Asas, 356 

Asat, 354 

Ashaganna, 310 

Ashaninca, 374 

Asheninca, 374 

Ashkun, 325 

Ashuku, 309 

Asienara, 355 

Asilulu, 345 

Asmat, Casuarina Coast, 
288, 355 

Asmat, Central, 288, 355 

Asmat, Citak, 288, 355 

Asmat, North, 288, 355 

Asong, 333 

Assaka, 311 

Assamese, 325 

Assyrian, 323 

Asua, 319 

Asumboa, 350 

Asuri, 334 

Asurini, 373 

Ata, 340 

Atakapa, 207, 213, 218, 368 

Atamanu, 345 

Atampaya, 363 

Atanque, 371 

Atayal, 338 

Atchin, 352 

Atemple, 357 

Aten, 310 

Athpare, 332 

Atinggola, 342 

Atsahuaca, 376 

Atsugewi, 369 

Atta, 338 

Attie, 308 

Atuot, 318 

Aturu, 358 

Au, 359 

+ Au.//ei, 301 

Auishiri, 239, 372 


Aulua, 352 

Aushi, 315 

Autu, 360 

Auyana, 354 

Ауа, 373 

Avar, 324 

Avatime, 308 

Avestan, 325 

Avikam, 308 

Avukaya, 318 

Awa, 354 

Awabakal, 186, 363 

Awadhi, 325 

Awak, 305 

Awar, 360 

Aweer, 278, 325 

Aweti, 373 

Awin, 355 

Awing, 311 

Awjila, 93, 320 

Awngi, 322 

Awngthim, 365 

Awun, 360 

Awutu, 408 

Амууі, 358 

Axamb, 352 

Ахуах, 324 

Ayabadhu, 363 

Aymara, 31, 204, 208, 289, 
372 

Ayoman, 371 

Ayoreo, 373 

Ayu, 310 

Azerbaijani, 127, 287, 328 

Aztec Nahuatl, 204, 289, 
370 


Baagandji, 189, 363 
Baba, 311 
Babaju, 311 
Babalia, 318 
Babanki, 312 
Babatana, 350 
Babine, 365 
Babuantu, 311 
Babungo, 312 
Baburiwa, 358 
Babuyan, 339 
Babuza, 338 
Bacham, 311 
Bachingu, 311 
Bactrian, 325 
Bada, 542 
Badaga, 330 
Bade, 321 
Badjiri, 364 
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Badyara, 303 

Baetora, 352 

Bafo, 312 

Bafumen, 312 

Bafussam, 311 

Bafut, 311 

Baga Binari, 304 

Baga Koga, 304 

Baga Maduri, 304 

Baga Mboteni, 303 

Baga Sitemu, 304 

Baga Sobane, 304 

Bagam, 311 

Bagirmi, 318 

Bago, 305 

Bagobo, 341 

Bagulal, 324 

Bagupi, 357 

Baham, 356 

Bahau, 343 

Bahau River Kenyah, 343 

Bahinemo, 360 

Bahing, 332 

Bahnar, 336 

Bai (SINITIC), 331 

Bai (UBANGIAN), 306 

Baibai, 359 

Baimak, 357 

Bainapi, 355 

Baining, 361 

Baka (CENTRAL 
SUDANIC), 318 

Baka (UBANGIAN), 307 

Bakairi, 375 

Baki, 352 

Bakumpai, 343 

Bakung Kenyah, 343 

Bakwe, 304 

Balaesan, 342 

Balangaw, 339 

Balant, 303 

Balantak, 342 

Bale, 317 

Balendru, 112, 319 

Bali (BANTU, ZONE B), 
161, 313 

Bali (BANTU, ZONE D), 
161, 313 

Bali (SUNDIC), 344 

Balong, 312 

Baluan, 350 

Baluchi, 38, 326 

Baluj, 325 

Balum, 311 

Bam (FINISTERRE), 354 

Bam (SEPIK), 347 
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Bamaha, 311 
Bamali, 312 
Bambalang, 311 
Bambara, 95, 303 
Bambeshi, 322 
Bambui, 311 
Bambuka, 306 
Bamendiinda, 311 
Bamenkombit, 311 
Bamenyam, 311 
Bamessi, 312 
Bamessing, 312 
Bamu Kiwai, 358 
Bamugun, 311 
Bamun, 311 
Bamunka, 312 
Bana, 321 
Banaro, 360 
Banawa, 373 
Banda, 345 
Bandawa, 309 
Bandeng, 511 
Bandi, 303 
Bandjalang, 362 
Bandobo, 312 
Bangando, 306 
Bangangte, 311 
Bangba, 307 
Banggai, 342 
Banggarla, 364 
Banggi, 340 
Bangubangu, 315 
Bangui, 312 
Bangwa, 311 
Baniata, 361 
Baniva, 374 
Baniwa, 374 
Banjari, 38, 325 
Banjun, 311 
Bankon, 312 
Banoni, 350 
Banta, 304 
Bantayan, 341 
Bantik, 342 
Banton, 341 
Banyun, 303 
Bapi (BAMILEKE), 311 
Bapi (NUN), 511 
Bapu, 359 
Baraba, 328 
Barai, 356 
Barain, 320 
Barakai, 346 
Baram Kayan, 345 
Barama, 313 
Barambo, 307 


Barasano, Northern, 372 

Barasano, Southern, 372 

Barau, 358 

Barawa, 322 

Barbacoas, 571 

Bare, 374 

Bari, 82, 109, 318 

Bariai, 348 

Bariba, 102, 304 

Bariji, 356 

Barim, 348 

Barok, 349 

Baropasi, 359 

Barrow Point, 362 

Baruga, 356 

Barunggam, 363 

Baruya, 355 

Basaa, 312 

Basap, 343 

Basari (GUR), 305 

Basari (WEST ATLANTIC), 
303 

Basay, 338 

Bashar, 310 

Bashkardi, 326 

Bashkarik, 325 

Bashkir, 127, 287, 328 

Basila, 308 

Basketo, 322 

Basque, 24, 33, 128f, 257, 
377 

Bassa, 304 

Bassa-Kaduna, 310 

Bassa-Kontagora, 310 

Bassa-Kuta, 310 

Bassa-Kwomu, 310 

Bata, 321 

Batak, 161, 340 

Batak, Karo, 344 

Batak, Toba, 344 

Batanta, 347 

Bateg Пед, 337 

Bateg Nong, 337 

Bati (BANTU, ZONE C), 


312 

Bati (GRASSFIELDS 
BANTU), 311 

Batie, 311 

Bats, 324 

Batu, 311 

Batuan, 343 

Batumerah, 345 

Bau, 357 

Bauchi, 310 

Baudo, 371 

Baukan, 342 
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Baule, 308 

Baure, 374 

Bauro, 351 

Bauwaki, 356 

Bauzi, 359 

Bay, Southwest, 352 

Bayali, 363 

Bayanga, 307 

Bayot, 303 

Bayso, 323 

Bayungu, 364 

Be, 337 

Beami, 355 

Beaufort Murut, 342 

Beaver, 365 

Bebe, 312 

Bebele, 312 

Bedik, 303 

Bedoanas, 346 

Befang, 312 

Bega, 319 

Beja, 89, 91, 322 

Bekati’, 343 

Bekwarra, 310 

Bele, 321 

Beli (CENTRAL 
SUDANIC), 318 

Beli (TORRICELLI), 359 

Bella Coola, 232, 366 

Bellari, 140, 330 

Bemal, 357 

Bemba, 95, 315 

Bembe (BANTU, ZONE D), 


313 

Bembe (ВАМТО, ZONE Н), 
314 

Bena, 314 

Benabena, 354 

Вепсһо, 322 

Bende, 314 

Bendi, 310 

Bendiapa, 372 

Benga, 312 

Bengali, 38, 42, 286, 325 

Benyadu', 343 

Вео, 313 

Beothuk, 2076, 214, 216, 
366 

Верош, 357 

Berawan, 343 

Вегік, 358 

Berta, 81, 834, 108f, 111f, 
319 

Berti, 317 

Bete, 304 

Betoi, 238, 371 
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Bezhta, 74, 324 
Bhili, 38, 325 
Bhojpuri, 325 
Bhramu, 351 
Biafada, 303 
Biak, 347 

Biaka, 359 

Bian Marind, 355 
Biangai, 556 
Biat, 336 

Biatah, 343 
Biate, 332 
Bichole, 347 
Bidiyo, 320 
Bidjara, 364 
Bidyara, 363 
Bierebo, 352 
Bieri, 305 

Bieria, 352 

Big Nambas, 352 
Bijago, 101, 303 
Bikaru, 360 
Biksi, 360 
Bilaan, Cotabato, 341 
Bilaan, Sarangani, 341 
Bilakura, 357 
Bilbil, 347 

Bile, 311 

Bileki, 349 
Biliau, 347 

Bilin, 320 

Biloxi, 205, 233, 367 
Biltine, 112, 317 
Bilua, 361 

Bima, 346 

Bimin, 355 

Віпа, 349 
Binahari, 356 
Binandere, 356 
Binawa, 309 
Bine, 358 

Bini, 309 
Binigura, 364 
Binja, North, 313 
Binja, South, 313 
Bintauna, 342 
Bintulu, 343 
Binukid, 340 
Binumarien, 354 
Bira, 515 

Birale, 322 
Birao, 351 

Birar, 329 

Birgid, 317 
Birhor, 334 

Вігі, 363 


Birom, 278, 310 

Birria, 363 

Bisa (BANTU), 315 

Bisa (MANDE), 303 

Bisaya, Northern, 342 

Bisaya, Southern, 342 

Biseni, 3o7 

Bisis, 36o 

Bislama, 377 

Bisu, 333 

Bitara, 36o 

Bitare, 311 

Bitieku, 311 

Biwat, 36o 

Biyom, 357 

Blablanga, 35o 

Blackfoot, 231, 288, 366 

Blagar, 359 

Bo, 359 

Boano, 345 

Boazi, 355 

Bobangi, 313 

Bobo-Fing, 101, 302 

Bobot, 345 

Bodo (BANTU), 313 

Bodo (ТІВЕТО-ВОКМАМ), 
332 

Boghom, 322 

Boholano, 341 

Bohutu, 348 

Boianaki, 348 

Boiken, 36o 

Bokyi, 310 

Bola, 349 

Bolango, 342 

Bolano, 342 

Bole, 321 

Bolgo, 3o6 

Bolinao, 339 

Bolo, 314 

Bom (MADANG- 
ADELBERT RANGE), 357 

Bom (WEST ATLANTIC), 
304 

Bomasa, 307 

Bomwali, 312 

Bonari, 375 

Bondei, 314 

Bonerate, 342 

Bonerif, 358 

Bongili, 313 

Bonglong, 334 

Bongo (CENTRAL 
SUDANIC), 318 

Bongo (OCEANIC), 347 


Bongu, 357 
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Bonkeng, 312 

Bonkiman, 354 

Bontok, Central, 339 

Bontok, East, 339 

Bontok, South, 339 

Bor, (DINKA), 317 

Bor, (LUO), 318 

Bora, 375 

Borai, 359 

Boro, 322 

Bororo, 289, 376 

Borunca, 371 

Bosha, 322 

Bosilewa, 349 

Bosman, 361 

Botlix, 324 

Botocudo, 241, 376 

Botolan, 339 

Bouye, 360 

Bouyei, 149, 337 

Bowili, 308 

Bozo, 302 

Bragat, 359 

Brahui, 138—40, 141, 330 

Brao, 336 

Brat, 359 

Brec, 331 

Breri, 361 

Breton, 30, 33, 37, 42, 59, 
287, 327 

Bribri, 371 

Brü, 335 

Bu, 312 

Bua (ADAMAWA), 306 

Bua (BANTU), 313 

Bubi, 512 

Budibud, 349 

Buduma, 321 

Budux, 324 

Budya, 315 

Buem, 308 

Bughotu, 351 

Bugis, 161, 342 

Buglere, 371 

Buhi, 340 

Buhid, 340 

Buin, 361 

Buja, 313 

Buji, 309 

Buka, 301 

Bukar Sadong, 343 

Bukat, 343 

Bukawac, 348 

Bukidnon, Western, 340 

Bukitan, 343 

Bulalakawnon, 341 
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Bulgarian, 38, бо, 287, 327 

Bulgebi, 354 

Buli (GUR), 305 

Buli (SOUTH 
HALMAHERA), 347 

Bullom, Northern, 304 

Bullom, Southern, 304 

Bulu (BANTU), 312 

Bulu (OCEANIC), 349 

Bum, 312 

Buma, 350 

Bumaji, 310 

Bumbita, 359 

Bun, 360 

Buna, 360 

Bunaba, 362 

Bunabun, 357 

Bunak, 559 

Bunama, 348 

Bunan, 331 

Bungain, 360 

Bungandidj, 365 

Bungku, 342 

Bungu, 314 

Bunun, 338 

Buol, 342 

Bura, 321 

Burarra, 362 

Burgundian, 327 

Buriat, 329 

Burji, 322 

Burmbar, 552 

Burmese, 143, 287, 333 

Buru, 345 

Burua, 372 

Burum, 354 

Burun, 317 

Burunge, 323 

Burushaski, 126, 257, 377 

Burusu, 343 

Busa (MANDE), 303 

Busa (UNCLASSIFIED), 
179, 377 

Busami, 347 

Busang, 343 

Bushong, 313 

Buso, 306 

Bussa, 325 

Butam, 361 

Butmas, 352 

Buton, 342 

Buton, South, 342 

Butu, 309 

Butuanon, 341 

Buyu, 313 

Bwaidoka, 349 


Bwaka, 307 

Bwamu, 305 

Bwe, 331 

Bwela, 313 

Bwile, 315 

Byangsi, 331 
Byelorussian, 38, 287, 327 


Caac, 351 

Cabecar, 371 

Cabishi, 373 

Cabiyari, 374 

Cacua, 372 

Caddo, 232, 367 

Cafundo Creole, 377 

Cagaba, 371 

Cahita, 570 

Cahuarano, 371 

Cahuilla, 370 

Cakchiquel, 204, 288, 369 

Callahuaya, 373 

Camanahua, 376 

Cambodian Khmer, 149, 
287, 336 

Camotes, 341 

Camsa, 240, 372 

Canamari, 572 

Candoshi, 373 

Canela, 377 

Canichana, 239, 372 

Cantonese Yue, 143, 287, 
331 

Capachene, 376 

Capanahua, 376 

Capishana, 239, 372 

Capuibo, 376 

Caquetio, 374 

Caquinte, 374 

Carabayo, 377 

Caraga, 341 

Caraja, 241, 376 

Caramanta, 371 

Caranga, 575 

Carapana, 372 

Carare, 375 

Carib—Galibi, 289, 375 

Caribbean Creole, Western, 
377 

Carijona, 375 

Caripuna, 576 

Carolina, 366 

Carolinian, 351 

Carrier, 365 

Casharari, 374 

Cashibo, 376 

Cashinahua, 376 
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Cashiniti, 374 
Cashuena, 375 
Casiguran Dumagat, 338 
Casu, 314 


Catacao, 371 
Catalan, 37, 59, 287, 326 
Catawba, 232, 235, 367 
Catawishi, 372 
Catiana, 574 
Catio, 371 
Catuquina (MACRO- 
ТОСАМОАМ), 372 
Catuquina (PANOAN), 376 
Cavineña, 376 
Cawishana, 374 
Cayapa, 289, 371 
Cayapo, 377 
Cayuga, 367 
Cayuse, 217f, 368 
Cayuvava, 240, 372 
Cebuano, 161, 164, 288, 341 
Cemuhi, 351 
Ch'iang, 332 
Ch'ien-chiang, 337 
Chacobo, 376 
Chaga, 314 
Chairel, 333 
Chakali, 305 
Chakosi, 308 
Chakpa, 333 
Chakrima, 332 
Chala, 305 
Chali, 335 
Cham, 305 
Cham, East, 344 
Cham, West, 344 
Chamacoco, 373 
Chamalal, 324 
Chamba (ADAMAWA), 306 
Chamba (TIBETO- 
BURMAN), 331 
Chambri, 360 
Chami, 371 
Chamicuro, 240, 373 
Chamorro, 166ff, 338 
Chana, 375 
Chane, 374 
Chang, 332 
Changriwa, 360 
Chapacura, 373 
Chaplino, 329 
Chara, 322 
Charrua, 375 
Chatino, 371 
Chaudangsi, 331 
Chaurasya, 332 
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Chavacano, 377 

Chavante, 289, 377 

Chawai, 309 

Chayahuita, 571 

Chayma, 575 

Che' Wong, 337 

Chechen, 74, 324 

Chehalis, Lower, 366 

Chehalis, Upper, 366 

Chemakum, 366 

Chemehuevi, 370 

Chenapian, 360 

Cheng, 336 

Chepang, 331 

Cherente, 377 

Cherokee, 205, 228, 288, 
367 

Cheyenne, 205, 231, 288, 
366 

Chiapanec, 371 

Chibak, 321 

Chibcha, 371 

Chicao, 375 

Chichimeca, 370 

Chicomuceltec, 368 

Chikena, 375 

Chilcotin, 365 

Chimane, 228, 241, 375 

Chimariko, 207, 211-18 
passim, 234, 369 

Chimbu, 355 

Chimila, 371 

Chimu, 238, 371 

Chinantec, 215, 371 

Chinbok, 333 

Chinbon, 333 

Chinese Mandarin, Yue, 
etc., 143, 331 

Chinese, Archaic, 531 

Chinook Jargon, 207, 214, 
217Í, 235, 288, 378 

Chinook, Lower, 207, 214, 
217f, 235, 288, 367 

Chinook, Upper, 207, 214, 
217f, 235, 288, 367 

Chip, 321 ` 

Chipaya, 225, 230, 373 

Chipewyan, 197, 365 

Chigui, 376 

Chiquitano, 377 

Chiquito (MACRO-GE), 376 

Chiquito (MAIPURAN), 


374 

Chiripong, 308 

Chiru, 332 

Chitimacha, 2o7, 213f, 218, 
236, 368 





Chitkhuli, 331 

Chiwere, 367 

Chocho, 371 

Choctaw, 368 

Choko, 333 

Chokobo, 309 

Chokwe, 315 

Chol, 368 

Cholona, 371 

Cholti, 368 

Chomo, 309 

Chong, 336 

Chontal, 368 

Chopi, 316 

Chorote, 375 

Chorti, 368 

Chrau, 335 

Chru, 344 

Chuabo, 315 

Chuave, 355 

Chuj, 368 

Chukchi, 128, 135-36, 287, 
329 

Chulupi, 375 

Chulym, 328 

Chumana, 376 

Chumash, 207, 211, 217f, 
369 

Churuya, 373 

Chutiya, 332 

Chuvantsy, 328 

Chuvash, 127, 131ff, 287, 
328 

Citak Asmat, 288, 355 

Clallam, 366 

Coahuiltec, 214, 218, 234f, 
369 

Cocama, 373 

Coconuco, 371 

Cocopa, 369 

Coeruna, 375 

Coeur d'Alene, 366 

Cofan, 373 

Colan, 371 

Colorado, 371 

Columbian, 366 

Comanche, 205, 289, 370 

Comecrudo, 214, 217f, 369 

Comox, 366 

Coos, 208, 214, 217, 367 

Coptic, 320 

Сога, 370 

Coraveca, 376 

Coreguaje, 372 

Cornish, 30, 59, 327 

Coroado, 376 

Coropo, 376 
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Cotabato Bilaan, 341 

Cotoname, 214, 218, 369 

Covare, 376 

Cowlitz, 366 

Cozarini, 374 

Cree, 231, 288, 366 

Creek Muskogee, 205, 
288, 368 

Crichana, 375 

Crioulo, 377 

Croatian Serbo-Croatian, 
38, 60, 287, 327 

Crow, 235, 288, 367 

Cua, 336 

Cuaiquer, 371 

Cuati, 374 

Cubeo, 372 

Cucura, 240, 374 ` 

Cuiba, 373 

Cuica, 372 

Cuicatec, 570 

Cuitlatec, 219, 230, 238, 371 

Cujuna, 373 

Culina, 373 

Culli, 239, 371 

Cumana, 373 

Cumanagoto, 375 

Cuna, 208, 238, 289, 371 

Cuniba, 374 

Cupeno, 370 

Curripaco, 374 

Curucane, 376 

Cushichineri, 374 

Custenau, 374 

Cutinana, 374 

Czech, 38, 60, 287, 327 


Da'ang, 334 
Daba, 321 

Dabra, 358 

Dadia, 305 
Dadibi, 357 

Daga, 356 
Dagalag, 363 
Dagara, 305 
Dagbani, 305 
Dagoman, 362 
Dagur, 133, 329 
Dahalo, 323 
Dahating, 354 
Dai, 346 

Daignet, 333 
Daiso, 314 

Daka, 306 
Dakaka, 352 
Dakota, 205, 233, 288, 367 
Dalmatian, 59, 326 
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Dama (ADAMAWA), 306 
Dama (KHOISAN), 301 
Damar, 346 
Dameli, 325 
Dampelasa, 542 
Dan (GUR), 305 
Dan (MANDE), 303 
Danaru, 356 
Danau, 334 
Dangal, 348 
Dangla, 320 
Dani, Grand Valley, 356 
Dani, Western, 356 
Danish, 37, 60, 277f, 287, 
327 
Danisin, 301 
Daonda, 358 
Daraga, 340 
Darambal, 363 
Dargwa, 324 
Darkinyung, 363 
Darmiya, 331 
Dasenech, 323 
Datagnon, 341 
Datooga, 318 
Davaweno, 341 
Dawawa, 348 
Dawera, 346 
Day, 306 
Dayak, Malayic, 344 
Debri, 317 
Dedua, 354 
Defaka, 307 
Degema, 308 
Degenan, 354 
Dehu, 351 
Dek, 306 
Delo, 305 
Dem, 354 
Demta, 356 
Dengebu, 302 
Deni, 373 
Deno, 321 
Dera, 358 
Desano, 372 
Dese, 317 
Deti, 301 
Dewoin, 304 
Dghwede, 321 
Dhalandji, 364 
Dhangu, 362 
Dharawal, 363 
Dharuk, 363 
Dhay’yi, 362 
Dhimal, 331 
Dhuduroa, 362 
Dhurga, 363 


Dhuwal, 362 
Di, 313 
Dibabawon, 340 
Dida, 304 

Dido, 324 
Didra, 336 
Diegueno, 369 
Digaro, 331 
Digo, 314 
Digul, Upper, 355 
Digüt, 373 
Dimasa, 332 
Dime, 322 
Dimir, 357 
Dinik, 317 
Diodio, 349 
Diola, 303 

Diri, 322 
Diriku, 315 
Dirrim, 306 
Disa, 318 

Dixon Reef, 352 
Diyari, 364 
Dizi, 322 
Djamindjung, 362 
Djanti, 312 
Djaru, 364 
Djinang, 362 
Djinba, 362 
Djingili, 188, 362 
Djongkang, 343 
Djuka, 377 
Djwarli, 364 
Dobel, 346 
Dobu, 349 
Doga, 348 
Doghose, 305 
Dogon, 304 
Dogoro, 356 
Dogri, 325 
Dogrib, 365 
Dohoi, 343 
Doka, 310 
Doko, 309 
Dolgan, 328 
Domu, 356 
Domung, 354 
Dondo (BANTU), 314 
Dondo (ТОМІМІ), 342 
Donga, 306 
Dongo, 307 
Dongolawi, 317 
Doni, 317 
Dori'o, 351 
Dorin, 376 
Doriri, 356 
Doromu, 356 
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Dororo, 361 
Dorro, 358 
Dorsha, 322 
Doura, 349 
Doyayo, 278, 306 
Dschang, 311 
Duala, 312 
Duampu, 333 
Duan, 336 
Duano’, 344 
Duau, 348 
Dubea, 351 
Dubu, 358 
Duduela, 357 
Duit, 371 

Duka, 310 

Duke of York, 349 
Duma, 313 
Dumagat, Casiguran, 338 
Dumaget, Umiray, 338 
Dumaki, 325 
Dumbo, 312 
Dumi, 332 
Dumpas, 342 
Dumpu, 356 
Dumun, 356 
Duna, 355 

Dungi, 309 
Dungmali, 332 
Dupa, 306 
Duranmin, 360 
Duriankere, 358 
Duru, 306 
Dusner, 347 
Dusun Deyah, 343 
Dutch, 60, 287, 327 
Dutch Creole, 377 
Duwai, 321 
Dyaabugay, 363 
Dyangadi, 363 
Dye, 305 
Dyeraidy, 362 
Dyimini, 304 
Dyirbal, 363 
Dyirringany, 363 
Dyula, 95, 303 
Dzindza, 314 
Dzorgai, 332 
Dzubucua, 373 


Easter Island Rapanui, 353 
Ebang, 302 

Ebira, 308 

Eblaic, 323 

Ebrie, 308 

Ederah, 355 

Efate, North, 352 
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Efate, South, 352 

Efik, 95, 309 

Efutop, 311 

Eggon, 310 

Egyptian, Ancient, 79, 81, 
83, 286 

Eipo, 358 

Eitiep, 359 

Eivo, 361 

Ejagham, 311 

Ekagi, 356 

Ekajuk, 311 

Ekit, 309 

Ekoka, 301 

Ekpeye, 309 

Elamite, 138, 141, 330 

Eleme, 309 

Elepi, 360 

Elkei, 359 

Elmolo, 325 

Eloyi, 308 

Emae, 353 

Emai, 309 

Emane, 311 

Embera, 289, 371 

Embu, 314 

Emerillon, 373 

Emerum, 357 

Emok, 375 

Emumu, 358 

En, 335 

Ende, 346 

Enets, 328 

Enga, 172, 288, 355 

Engenni, 308 

Enggano, 344 

English, 42, 60, 76, 204, 287, 


327 

Enindhilyagwa, 188, 362 

Enya, 313 

Eotile, 308 

Epie, 308 

Epigraphic South Arabian, 
323 

Erima, 357 

Erokwanas, 346 

Eruwa, 309 

Eryuan, 331 

Esan, 309 

Ese'ejja, 278, 376 

Esimbi, 311 

Eskimo—Inuit, Yupik, 128, 
193ff, 329 

Esmeralda, 373 

Esperanto, 378 

Esselen, 207, 211-18 pas- 
sim, 369 


Estonian, 65, 287, 328 
Eton, 312 

Etruscan, 377 

Etulo, 308 

Evant, 311 

Even, 329 

Evenki, 287, 329 

Ewe, 95, 97, 308 
Ewondo, 312 

Eyak, 195, 197-200, 365 


Ға d'Ambu, 377 

Fa’, 312 

Faai, 375 

Fachara, 310 

Барапі, 351 

Fagululu, 349 

Ғайа, 357 

Faiwol, 355 

Falasha—Kemant, 322 

Fali, 102, 305 

Faliscan, 326 

Falor, 303 

Fanagalo, 377 

Fang, 312 

Fanji, 312 

Fanya, 306 

Faroese, 60, 327 

Fars, 326 

Farsi, 38, 42, 278, 286, 326 

Fas, 359 

Fasu, 355 

Fataluku, 278, 359 

Fe'fe, 311 

Feroge, 306 

Fijian, 165, 288, 352 

Finnish, 33, 65, 287, 328 

Finungwa, 354 

Fipa, 514 

Fitita, 375 

Fiwaga, 355 

Flemish Dutch, 37, 60, 
287, 327 

Flinders Island, 362 

Foau, 358 

Foe, 355 

Foma, 315 

Fomopea, 311 

Fon, 308 

Fongondeng, 311 

Fongoro, 318 

Forak, 354 

Fore, 354 

Fortsenal, 352 

Foto, 311 

Fotuni, 311 

Fox, 231, 366 
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Franco-Provengal, 326 

French, 37, 287, 326 

French Creole, 377 

Frisian, 60, 327 

Friulian, 326 

Fuga, 322 

Fula, 81, 95, 98, 286, 303 

Fulani— Fula, 81, 95, 98, 
286, 303 

Fuliru, 314 

Fulnio, 241, 376 

Fur, 81, 84, 108f, 1114, 317 

Furu, 318 

Futuna, West, 353 

Fuyuge, 356 

Fwai, 351 

Fyam, 310 

Fyer, 321 


Ga, 308 

Ga-dang, 338 

Ga’anda, 320 

Gaam, 317 

Gabadi, 349 

G//abake, 301 

Gabi, 363 

Gabiano, 360 

Gabri, 320 

Gabrielino, 237, 370 

Gabutamon, 354 

Gadaba, 330 

Gadang (AUSTRALIAN), 363 

Gadang (CHADIC), 320 

Gaddang, 338 

Gade, 308 

Gade Lohar, 325 

Gadjerawang, 362 

Gadsup, 354 

Gaelic, Irish—lIrish, 33, 37, 
42, 278, 287, 327 

Gaelic, Scottish, 33, 327 

Gafat, 323 

Gagauz, 328 

Gagudju, 188, 362 

Gahuku, 354 

Gaina, 356 

Gal, 357 

Galambu, 321 

Galela, 359 

Galeya, 349 

Galibi, 289, 375 

Galice, 365 

Galician, 326 

Galke, 306 

Galler, 335 

Galoli, 346 

Galu, 359 
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Gambai, 318 

Gamei, 361 

Gamella, 239, 372 

Gamilaraay, 365 

Gan (CHINESE), 143, 331 

Gan (GUR), 505 

Gana, 342 

G//ana, 301 

Ganade, 301 

Gandole, 306 

Gandua, 311 

Gane, 346 

Gangam, 305 

Ganglau, 356 

Gangulu, 363 

G//ani, 301 

Ganja (EAST NEW 
GUINEA HIGHLANDS), 


355 
Ganja (WEST ATLANTIC), 
303 
Ganjule, 322 
Gants, 354 
Ganza, 322 
Ganzi, 307 
Gao, 350 
Gapapaiwa, 348 
Gapun, 178, 360 
Garadyari, 364 
Garawa, 362 
Garhwali, 38, 325 
Garo (DUSUN-BISAYA), 342 
Garo (ТІВЕТО-ВОКМАМ), 
332 
Garuh, 357 
Garus, 357 
Garuwahi, 348 
Garuwali, 364 
Gasmata, 349 
Gatsame, 322 
Gaulish, 327 
Gaviao, 373 
Gawar, 321 
Gawar-bati, 325 
Gawwada, 322 
Gayardilt, 362 
Gayiri, 363 
Gayo, 344 
Gayon, 371 
Gazi, 326 
Gbanzili, 307 
Gbaya, 306 
Gbi, 307 
Geba, 331 
Gebe, 347 
Gedaged, 347 
Gedeo, 322 


Geez, 323 
Geji, 322 
Gekho, 331 
Gela, 351 
Gelama, 306 
Gelao, 337 
Gele (ADMIRALTY 
ISLANDS), 350 
Gele (CENTRAL 
SUDANIC), 318 
Gende, 354 
Gengle, 306 
Georgian, 24, 71—72, 286, 
324 
Gera, 321 
Gerka, 321 
German, 37, 42, 60, 287, 327 
Geser, 345 
Gesogo, 313 
Ghadames, 320 
Ghale, 331 
Ghanongga, 350 
Ghardaia, 320 
Ghari, 351 
Ghmara, 320 
Ghotuo, 309 
Giay, 337 
Gidar, 321 
Gidiccho, 322 
Gidole, 323 
Gidra, 358 
Gilaki, 326 
Gilbertese, 351 
Gilyak, 126, 259, 377 
Giman, 347 
Gimi, 354 
Ginuman, 356 
Gira, 354 
Girawa, 357 
Giri, 361 
Gisiga, 321 
Gitua, 348 
Giya, 363 
Gizra, 358 
Glavda, 321 
Gnau, 359 
Goajiro, 289, 374 
Godie, 304 
Godoberi, 324 
Goemai, 321 
Gog Nar, 363 
Goggot, 323 
Gogo, 314 
Gogo Mini, 363 
Gogodala, 355 
Gokana, 309 
Gola, 304 
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Gold, 329 

Goliath, 358 

Golo, 307 

Gondi, 138, 140, 330 

Gonja, 308 

Goom, 306 

Gorani, 326 

Gorontalo, 342 

Gorovu, 360 

Gorum, 334 

Gothic, 27, 327 

Gova, 315 

Gowar, 363 

Grebo, 304 

Greek, 27, 33, 37, 4250-53, 
286, 326 

Greek, Classical, 326 

Greenlandic, 193, 329 

Gta', Hill, 334 

Gta', Plains, 334 

Guachi, 375 

Guahibo, 373 

Guaicuru, 375 

Guaja, 373 

Guajajara, 373 

Gualachi, 376 

Guamaca, 371 

Guambiano, 371 

Guamo, 240, 373 

Guana (MACRO- 
РАМОАМ), 375 

Guana (MAIPURAN), 374 

Guanano, 372 

Guanche, 93, 320 

Guanebucan, 374 

Guaque, 375 

Guarani, 204, 289, 373 

Guarategaya, 373 

Guarayu, 373 

Guarequena, 374 

Guariba, 372 

Guarijio, 370 

Guariza, 3'76 

Guato, 241, 376 

Guatuso, 371 

Guaviare, 377 

Guayabero, 373 

Guayaki, 373 

Guaymi, 289, 371 

Guba, 322 

Gubat, 341 

Gubi, 311 

Gubu, 307 

Gudang, 363 

Gude, 321 

Gudjala, 363 

Gudu, 321 


440 


Gudutf, 321 
Gudungura, 363 
Guegue, 377 
Güenoa, 375 
Guere, 304 
Guetar, 371 
Gugadj, 363 
Gugu Badhun, 363 
Gugu Warra, 363 
Gugu Yalandyi, 363 
Guhu-Semane, 356 
Guiarak, 354 
Guinao, 374 
Gujarati, 38, 286, 325 
Gula (ADAMAWA), 306 
Gula (CENTRAL 
SUDANIC), 318 
Guliguli, 361 
Gulla—Sea Island Creole, 
277 
Gumalu, 557 
Gumasi, 349 
Gumbaynggir, 565 
Gunbudj, 188, 362 
Gundi, 307 
Gunggari, 363 
Guniyan, 362 
Gunwinygu, 362 
Gurage, Central West, 323 
Gurage, East, 323 
Gurage, Peripheral West, 
323 
Guragone, 362 
Gurara, 320 
Gure, 309 
Gureng Gureng, 363 
Gurenne, 305 
Gurindji, 364 
Gurma, 305 
Gurmana, 310 
Gurung, 331 
Guruntum, 322 
Gusan, 354 
Gusii, 314 
Gusilay, 303 
Gutob, 334 
Guugu Yimidhirr, 185, 363 
Guwa, 363 
Guwamu, 363 
Guwot, 348 
Guyanese, 377 
Gvoko, 321 
Gwandara, 321 
Gwantu, 310 
Gwari, 308 
Gwedena, 356 


Gweno, 314 

Gwere, 314 

G/wi, 301 

Gyarung, 331 

Gyem, 309 
Gypsy—Romany, 286, 325 


Habu, 346 

Hadiyya, 322 

Hadza, 84, 116-19, 301 

Hahon, 350 

Haida, 195, 197-99, 208-18 
passim, 228, 365 

Hai.n//um, 301 

Haisla, 366 

Haka, 333 

Halang, 336 

Halia, 350 

Halkomelem, 366 

Hallam, 332 

Ham, 347 

Hamar, 322 

Hamtai, 355 

Han, 365 

Handa, 301 

Hanga, 305 

Hangaza, 314 

Hani, 143, 333 

Hanunoo, 340 

Harari, 323 

Hare, 365 

Haroi, 344 

Harso, 322 

Haruku, 346 

Harzani, 326 

Hattam, 359 

Hausa, 79, 81, 86, 321 

Haush, 372 

Havasupai, 369 

Havongan, 343 

Havu, 314 

Hawaiian, 161, 288, 353 

Hawaiian Creole, 377 

Hawthai, 333 

Haya, 314 

Hebrew, 86, 286, 323 

Hebrew, Classical, 323 

Hehe, 314 

Heiltsuk, 366 

Helong, 346 

Hemba, 315 

Herero, 316 

Herisobocono, 373 

Hermit, 350 

Hewa, 360 

Heyo, 359 
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Hianacoto, 375 

Hibito, 371 

Hidatsa, 233, 367 

Higi, 321 

Hiligaynon, 161, 288, 341 

Hina, 321 

Hindi, 38, 42, 286, 325 

Hinihon, 357 

Hinux, 324 

Hiri Motu, 378 

Hishkaryana, 375 

Hittite, 56, 286, 325 

Hitu, 346 

Hiw, 352 

Hka-hku, 333 

Hkauri, 333 

Hmar, 333 

Hmwaveke, 351 

Но, 334 

Hoanya, 338 

Hoava, 350 

Hoking, 331 

Holikachuk, 365 

Holoholo, 313 

Holu, 315 

Hona, 320 

Hopi, 236f, 289, 370 

Horom, 310 

Horpa, 333 

Hote, 348 

НоНепіюі->Мата 
(KHOISAN), 70, 81, 114, 
116, 278, 286, 301 

Hozo, 322 

Hrangkhol, 332 

Hre, 336 

Hruso, 331 

Hsi-lin, 337 

#Hu, 301 

Huachipaeri, 374 

Huambisa, 373 

Huarayo, 376 

Huari, 372 

Huastec, 368 

Huave, 208, 213-20, 228, 
230, 235, 368 

Huichol, 370 

Huihui, 344 

Huilliche, 372 

Huli, 355 

Hulung, 345 

Humene, 356 

Hunde, 314 

Hungana, 314 

Hungarian, 65—66, 287, 328 

Hungu, 314 
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Hunjara, 356 
Hunzib, 74, 324 
Hupa, 365 
Hupda, 372 
Huron, 233, 367 
Hurrian, 55, 377 
Hurza, 321 
Hwethom, 333 


Г wak, 339 

Јааі, 351 

Iatmul, 288, 360 
Iban, 344 

Ibanag, 338 

Ibibio, 309 

Ibie, North, 309 
Ibino, 309 

Ibu, 359 

Ica, 371 

Icelandic, 37, 60, 287, 327 
Icen, 309 

Icheve, 311 

Ida'an, 342 

Idate, 346 

Idi, 358 

Idne, 349 

Idoma, 308 

Idong, 310 

Iduna, 349 

Ifugao, Central, 339 
Ifugao, East, 339 
Igala, 308 

Igana, 361 

Igbo, 95, 97, 286, 309 
Igede, 308 
Ignaciano, 374 
Igom, 361 

Igora, 348 

Iha, 356 

Ik, 318 

Ika, 309 

Ikobi, 357 

Ikpeshi, 309 

Iku, 310 

Ikulu, 310 

Ikundun, 357 
Ikwere, 309 

Ila, 315 

Шапеп, 340 

Illinois, 231, 366 
Ilokano, 161, 288, 338 
Ilongot, 339 
Imihita, 375 
Inanwatan, 358 
Inapari, 374 
Indo-Portuguese, 377 


Indonesian, 160, 164, 344 

Indri, 306 

Ingain, 376 

Ingalik, 278, 365 

Ingrian, 328 

Ingush, 74, 324 

Inibaloi, 339 

Interlingua, 378 

Inuit, Alaskan, 194-95, 329 

Inuit, Canadian, 194-95, 
329 

Ipiko, 357 

Ipili, 355 

Iquito, 371 

Irahutu, 346 

Iranon, 340 

Iranshe, 239, 372 

Iraqw, 323 

Iraya, 339 

Iresim, 347 

Iria, 355 

Ігіра, 340 

Ігіруге, 310 

Irish, 33, 37, 42, 278, 287, 
327 

Irula, 330 

Irumu, 354 

Isabi, 357 

Isala, 305 

Isamal, 341 

Ізап, 354 

Isconahua, 276 

Isebe, 357 

Ishkashmi, 325 

Isinai, 338 

Island Carib, 374 

Isleta, 370 

Isnag, 338 

Isoko, 278, 309 

Istro-Rumanian, 59, 326 

Isu, 312 

Italian, 37, 287, 326 

Itawis, 338 

Itayaine, 374 

Itbayaten, 339 

Itene, 375 

егі, 359 

Itik, 358 

Itneg, 339 

Itonama, 238, 571 

Itutang, 361 

Itza, 368 

Ivatanen, 339 

Ivori, 355 

Iwa, 515 

Iwaidja, 362 


Iwam, 360 
Iwur, 355 
Ixcatec, 371 
Dal, 369 
Iyekhee, 309 
Izaneni, 374 
Izarek, 310 
Izi, 309 


Jaba, 310 

Jacaltec, 369 

Jagoi, 343 

Jah Hut, 337 

Jaku, 150, 311 

Jama Mapun, 344 

Janjero, 322 

Janji, 309 

Jaoi, 374 

Japanese, 127-32, 258, 287, 
329 

Japreria, 375 

Jaqaru, 372 

Jara, 320 

Jarai, 344 

Jarawa (ANDAMAN 
ISLANDS), 354 

Jarawa (BANTU), 311 

Jaun-Jaun, 341 

Javanese, 161, 288, 344 

Jawe, 351 

Jawony, 362 

Jaya Bakti, 344 

Jebero, 371 

Jegu, 320 

Jeh, 336 

Jehai, 336 

Jen, 306 

Jerba, 320 

Jicaque, 208, 215-19 pas- 
sim, 234, 369 

Jidda, 310 

Jili, 333 

Jilim, 357 

Jimajima, 356 

Jimbin, 322 

Jinghpaw, 333 

Jirajara, 371 

Jiru, 309 

Jomang, 302 

Ju, 322 

Ju-chen, 329 

Juang, 334 

Jukun, 309 

Juma, 373 

Jur, 317 

Juray, 334 
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Kala, 335 

Kaantyu, 363 

Kaba, 318 

Kaba Dunjo, 318 

Kabalai, 320 

Kabardian, 72, 324 

Kabishiana, 373 

Kabola, 359 

Kabre, 305 

Kabungsuan, 340 

Kabwari, 314 

Kabyle, 86, 286, 320 

Kadara, 310 

Kadazan-Dusun, 342 

Kadazan, Eastern, 342 

Kadazan, Klias River, 342 

Kadiweu, 375 

Kadu, 333 

Kadugli, 302 

Kaeme, Upper, 355 

Kaeti, 355 

Kafa, 322 

Kafoa, 359 

Kagayanen, 340 

Kagoma, 310 

Kaguru, 314 

Kahayan, 343 

Kahe, 314 

Kahua, 351 

Kaian, 361 

Kaibobo, 346 

Kaidipang, 342 

Kaiep, 347 

Kaike, 331 

Kaili, 342 

Kaingang, 376 

Kairi, 357 

Kairiru, 347 

Kairui (CENTRAL MA- 
LAYO-POLYNESIAN), 
346 

Kairui (TRANS-NEW 
GUINEA), 359 

Kaititj, 364 

Kaivi, 309 

Kaiwa (OCEANIC), 348 

Kaiwa (TUPI), 373 

Kaiwai, 346 

Kajakse, 320 

Kajaman, 343 

Kaje, 310 

Kajoa, 344 

Kakwa, 318 

Kala Lagaw Ya, 188, 362 

Kalabakan, 342 

Kalabari, 307 


Kalabra, 359 
Kalagan, 341 
Kalam, 354 
Kalamansig, 340 
Kalamianen, 340 
Kalanga, 316 
Kalaotoa, 342 
Kalasha, 325 
Kalasha-ala, 325 
Kaleu, 335 
Kali, 306 
Kalibugan, 339 
Kaliko, 318 
Kalinga-Itneg, 339 
Kalinga, Central, 339 
Kalinga, North, 339 
Kalinga, South, 339 
Kalispel, 205, 288, 366 
Kalkatungu, 364 
Kallahan, North, 339 
Kallahan, South, 339 
Kalmyk, 329 
Kalokalo, 349 
Kalou, 360 
Kaluli, 355 
Kam (ADAMAWA), 102, 305 
Kam (KAM-TAL, 149, 288, 
337 
Kamakan, 376 
Kaman, 331 
Kamano, 354 
Kamanton, 310 
Kamara, 305 
Kamarian, 346 
Kamaru, 373 
Kamas, 328 
Kamasa, 355 
Kamasau, 360 
Kamayo, 341 
Kamayura, 373 
Kamba (BANTU), 95, 314 
Kamba (MADANG- 
ADELBERT RANGE), 357 
Kambari, Eastern, 310 
Kambari, Western, 310 
Kambata, 322 
Kambot, 360 
Kamchadal, 135, 329 
Kamhau, 332 
Kami, 314 
Kamkam, 311 
Kamoro, 355 
Kampong Baru, 358 
Kamu, 305 
Kamuku, 310 
Kamula, 355 
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Kana, 309 
Kanakanabu, 338 
Kanakuru, 321 
Kanashi, 331 
Kanauri, 331 
Kanawa, 306 
Kandas, 349 
Kande, 313 
Kanembu, 317 
Kanga, 302 
Kangsiangying, 333 
Kaniet, 349 
Kaningara, 360 
Kaningi, 313 
Kaningkon, 310 
Kanjobal, 369 
Kankanay-Itneg, 339 
Kankanay, Central, 339 
Kankanay, North, 339 
Kankanay, South, 339 
Kannada, 138, 287, 330 
Kanowit, 343 
Kansa, 367 
Kantilan, 341 
Kantu, 335 
Kanufi, 310 
Kanum, 358 
Kanuri, 278, 286, 317 
Kanyok, 315 
Kaonde, 315 
Kapampangan, 161, 339 
Kapin, 348 
Kapingamarangi, 353 
Kapore, 349 
Kapori, 358 
Kapriman, 360 
Kapuas, 343 
Kara (CENTRAL 
SUDANIC), 318 
Kara (OCEANIC), 349 
Karaboro, 304 
Karachay, 328 
Karagas, 328 
Karagwe, 314 
Karaim, 328 
Karakalpak, 328 
Karami, 357 
Karamojong, 318 
Karanga, 316 
Karankawa, 208, 214, 217f, 
369 
Karas, 356 
Karata, 324 
Karaw, 339 
Karawa, 360 
Karawari, 360 


Language Index 


Kare (BANTU), 313 
Kare (MADANG- 
ADELBERT RANGE), 357 
Karekare, 321 
Karelian, 328 
Karfa, 321 
Karipuna, 373 
Karitiana, 373 
Kariya, 322 
Kariyarra, 364 
Karlamay, 364 
Karo Batak, 344 
Karok, 208-18 passim, 234, 
369 
Кагоп, 303 
Karon Dori, 359 
Karon Pantai, 359 
Kasanga, 3o3 
Kasele, 305 
Kasem, 305 
Kasharari, 376 
Kashaya, 369 
Kashmiri, 38, 325 
Kashubian, 327 
Kashuyana, 375 
Kasiguranin, 338 
Kasimbar, 342 
Kaska, 365 
Kasua, 355 
Kasuweri, 358 
Katab, 310 
Katang, 335 
Katbol, 352 
Katcha, 3o2 
Kate, 354 
Katembri, 240, 373 
Kati, 325 
Kati, Northern, 355 
Kati, Southern, 355 
Katiati, 357 
Katingan, 343 
Katinja, 355 
Katio, 371 
Katla, 3o2 
Katu, 335 
Kaugat, 356 
Kaunga, 360 
Kaure, 358 
Kaurna, 364 
Kauwol, 355 
Kavalan, 338 
Kawaiisu, 37o 
Kawang, 344 
Kawarrangg, 363 
Kawatsa, 355 
Kawe, 347 


Kayabi, 373 

Kayah, 331 

Kayan Mahakam, 343 
Kayan, Baram, 343 
Kayan, Kayan River, 343 
Kayan, Mendalam, 343 
Kayan, Murik, 343 
Kayan, Wahau, 343 
Kayeli, 345 

Kaygir, 356 

Kayik, 359 

Kayong, 336 
Kayupulau, 347 
Kazakh, 127, 287, 328 
Kazukuru, 361 
Keapara, 349 

Kebu, 3o8 

Kedang, 346 

Kehelala, 348 

Kei, 346 

Keiga, 302 

Keiyo, 318 

Kejia, 143, 331 

Kekchi, 2o4, 288, 369 
Kela (BANTU), 313 
Kela (ОСЕАМІС), 348 
Kelabit, 342 

Kele, 313 

Kelinyao Kenyah, 343 
Kelo, 317 

Kelon, 359 

Kem Degne, 335 
Kemak, 346 

Kemant, 322 
Kembayan, 343 
Kemtuk, 358 

Kenati, 354 

Kenga, 318 

Keningau Murut, 342 
Kensiu, 337 

Kenta' Bong, 337 
Kenyah, Bahau River, 343 
Kenyah, Bakung, 343 
Kenyah, Kayan River, 343 
Kenyah, Kelinyao, 343 
Kenyah, Mahakam, 343 
Kenyah, Sebob, 343 
Kenyah, Tutoh, 343 
Kenyah, Upper Baram, 343 
Kenyah, Wahau, 343 
Kenyah, Western, 343 
Kenyang, 311 
Kepkiriwat, 373 

Kera, 32o 

Keramin, 364 

Kerebe, 314 
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Kereho-Uheng, 343 

Kerek, 136, 329 

Keres, Eastern, 208, 
214-18, 230, 288, 367 

Кегев, Western, 208, 
214-18, 230, 288, 367 

Kerewo, 558 

Keriaka, 361 

Kermanshahi, 326 

Kesawai, 356 

Ket, 126, 257, 377 

Ketangalan, 338 

Kete, 315 

Ketengban, 358 

Keuro, 358 

Kewa, 355 

Kewieng, 354 

Kezhama, 332 

Kha In, 335 

Kha Koh, 335 

Khakas, 328 

Khalaj, 328 

Khaling, 332 

Khalkha, 287, 329 

Kham, 331 

Khambu, 332 

Khamet, 335 

Khami, 333 

Khamti, 538 

Khandeshi, 325 

Kharia, 334 

Khasi, 149, 151, 155f, 334 

Khaskhong, 335 

Khmer, 149, 151, 155f, 287, 
336 

Khmu', 555 

Khoany, 333 

Khoirao, 332 

Khorchin, 329 

Khowar, 325 

Khua, 535 

Khumi, 333 

Khün, 338 

Khunsari, 326 

Khwarezmian, 325 

Khyang, 333 

Khyeng, 535 

Ki, 312 

Kia (OCEANIC), 350 

Kia (TRANS-NEW 
GUINEA), 355 

Kiangan, 339 

Kiaokio, 333 

Kiballo, 309 

Kibet, 317 

Kikuyu, 95, 314 


444 


Kila, 311 

Kilba, 321 

Kile, 329 

Kilenge, 348 

Kilivila, 349 

Kiliwa, 369 

Kilmeri, 358 

Kim, 102, 305 

Kimaghana, 358 

Kimaragang, 342 

Kimbu, 314 

Kinabatangan, Upper, 342 

Kinabe, 312 

Kinalakna, 354 

Kinamigin, 340 

Kinaray-a, 341 

Kinga, 314 

Kinkwa, 311 

Kinome, 358 

Kinuku, 309 

Kiong, 309 

Kiorr, 335 

Kiowa, 208, 213f, 228, 236, 
289, 370 

Kiowa Apache, 199, 365 

Kipsikiis, 318 

Kiput, 545 

Kiranti, 332 

Kirfi, 321 

Kirghiz, 127, 287, 328 

Kiriri, 373 

Kirma, 305 

Kirmanji, 326 

Kis, 347 

Kisar, 346 

Kisi (BANTU), 514 

Kisi (WEST ATLANTIC), 
304 

Kitanemuk, 370 

Kitimi, 309 

Kitja, 362 

Kitsai, 232, 367 

Kitwii, 311 

Kiwai, Bamu, 358 

Kiwai, Northeastern, 358 

Kiwai, Southern, 358 

Klamath, 208, 211—214, 218, 
368 

Klau, 304 

Klau, Peripheral, 304 

Ko, 502 

Ko-p'u, 333 

Koaia, 239, 372 

Koasati, 368 

Kobiana, 303 

Kobon, 354 


Koch, 332 

Kodagu, 138f, 330 

Kodi, 346 

Koenoem, 321 

Kofa, 311 

Kofyar, 321 

Koguman, 357 

Kohoroxitari, 377 

Kohumono, 309 

Koiari, 356 

Koiari, Mountain, 356 

Koita, 356 

Koke, 306 

Koko Pera, 363 

Kokota, 350 

Kol, 361 

Kola, 346 

Kolakngat, 363 

Kolami, 138—40, 330 

Kolana, 359 

Kolbila, 306 

Kolhreng, 332 

Kolod, 342 

Kolokuma, 307 

Kolom, 356 

Kom (BANTU), 312 

Kom (TIBETO-BURMAN), 332 

Koma, 305 

Komba, 354 

Kombio, 359 

Komering, 344 

Komi, 326 

Kominimung, 361 

Komo (BANTU), 313 

Komo (NILO-SAHARAN), 
319 

Komoro, 314 

Komutu, 354 

Kon Keu, 335 

Konabem, 312 

Konai, 355 

Konda (BANTU), 313 

Konda (DRAVIDIAN), 330 

Konda (TRANS-NEW 
GUINEA), 358 

Koneraw, 355 

Kongo, 95, 286, 314 

Konkani, 325 

Konkomba, 305 

Kono (CARA), 309 

Kono (MANDE), 303 

Konomala, 349 

Konso, 323 

Konua, 361 

Kony, 318 

Konyagi, 303 
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Konyak, 332 

Konzo, 314 

Kopar, 360 

Korafe, 356 

Koraga, 140, 330 

Korak, 357 

Korapun, 358 

Korean, 127-32 passim, 
259, 287, 329 

Koring, 309 

Korku, 334 

Koro (NYIMA), 310 

Koro (OCEANIC), 352 

Korop, 309 

Kortse, 332 

Korwa, 334 

Koryak, 135, 329 

Kosare, 358 

Kosarek, 358 

Koshin, 312 

Koso, 370 

Kosorong, 354 

Kota (BANTU, ZONE B), 


313 
Kota (BANTU, ZONE С), 


313 
Kota (DRAVIDIAN), 138, 

330 
Kota Belud, 344 
Kotogut, 355 
Kotoko, 321 
Kovai, 354 
Kove, 348 
Kowaki, 357 
Koya, 330 
Koyo, 313 
Koyra, 322 
Koyukon, 365 
Kpa, 312 
Kpagua, 307 
Kpan, 309 
Kpati, 311 
Kpelle, 303 
Kposo, 308 
Krachi, 308 
Krahn, 304 
Krangku, 333 
Kravet, 336 
Kreen-Akarore, 377 
Kresh, 112, 318 
Krim, 304 
Krio, 377 
Krisa, 359 
Krobu, 308 
Krongo, 302 
Kru, 97, 304 
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Kru'ng, 336 

Kryts, 324 

Kuba, 313 

Kube, 354 

Kubi, 321 

Kuda, 309 

Kugama, 306 

Kugbo, 309 
Kugu-Muminh, 363 

Kui (DRAVIDIAN), 138—40, 


330 
Kui (TRANS-NEW 
GUINEA), 559 
Kuijau, 342 
Kujarge, 320 
Kuka, 318 
Kukele, 309 
Kukuya, 348 
Kulaal, 306 
Kulango, 304 
Kulele, 304 
Kulere, 321 
Kulung (BANTU), 311 
Kulung (TIBETO- 
ВОКМАМ), 332 
Kumak, 351 
Kumam, 318 
Kumauni, 325 
Kumayena, 375 
Kumba, 306 
Kumokio, 354 
Kumyk, 328 
Kumzari, 326 
Kunama, 81, 83f, 108—12 
passim, 319 
Kunda, 315 
IKung— OKU, 116, 301 
Kungarakany, 188, 362 
Kuni, 349 
Kunimaipa, 356 
Kunyi, 314 
Kunza, 238, 371 
Kupome, 332 
Kupto, 521 
Kurada, 348 
Kurama, 309 
Kurdi, 38, 286, 326 
Kuria, 314 
Kurtjar, 363 
Kuruaya, 373 
Kurudu, 347 
Kurumba, 140, 330 
Kurumfe, 305 
Kurux, 138-40, 330 
Kusaal, 305 
Kusaghe, 350 


Kusaiean, 351 
Kuskokwim, Upper, 365 
Kusu, 315 
Kusunda, 331 
Kutasho, 376 
Kutchin, 365 
Kuteb, 278, 309 
Kutenai, 208, 214-18, 230, 
288, 366 
Kuthant, 365 
Kutin, 306 
Kutu, 314 
Kuturmi, 310 
Kuurn Kopan Noot, 363 
Kuvi, 530 
Kuwaa, 102, 304 
Kuwani, 359 
Kuy, 335 
Kuyunon, 341 
Kuzamani, 309 
Kw'adza, 323 
Kwadi, 301 
Kwaio, 351 
Kwaja, 311 
Kwakiutl-Kwakwala, 288, 
366 
Kwakum, 312 
Kwakwala, 288, 366 
Kwala, 313 
Kwale, 356 
Kwalhiogua, 199, 365 
Kwama, 319 
Kwambi, 316 
Kwamera, 350 
Kwang, 320 
Kwanga, 360 
Kwangali, 515 
Kwangwa, 315 
Kwanka, 310 
Kwanyama, 316 
Kwara'ae, 351 
Kware, 355 
Kwasengen, 360 
Kwatay, 303 
Kwato, 357 
Kwaya, 314 
Kwee, 301 
Kwegu, 317 
Kwei-yang, 337 
Kweni, 303 
Kwerba, 356 
Kwesten, 358 
Kwiri, 312 
Kwoma, 360 
Kwomtari, 359 
Kyao, 532 
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La, 335 
Labo, 352 
Labu, 348 
Labwor, 317 
Lacandon, 368 
Ladin, 326 
Lae, 348 
Laeko, 359 
Laganyan, 347 
Lagawa, 317 
Laghu, 350 
Laha, 337 
Lahanan, 343 
Lahnda, 38, 325 
Lahu, 333 
Laichau, 333 
Lak, 324 
Laka, 334 
Lakalei, 346 
Lakka, 306 
Lakkia, 337 
Lakona, 352 
Lala, 315 
Lalaki, 542 
Lalia, 313 
Lalung, 332 
Lama-Lama, 363 
Lamaholot, 346 
Lamang, 321 
Lamba (BANTU), 315 
Lamba (GUR), 305 
Lambichong, 332 
Lamboya, 346 
Lame, 311 
Lamet, 335 
Lametin, 352 
Lamgang, 332 
Lamma, 359 
Lamnso, 312 
Lamogai, 349 
Lampung, 161, 344 
Lamusong, 349 
Landuma, 304 
Langalanga, 351 
Langam, 360 
Langbasi, 307 
Langet, 333 
Langi, 314 
Lango, 318 
Langrong, 332 
Lanoh, 537 
Lao, 149, 288, 338 
Laotian—Lao, 149, 288, 338 
Lapp—Saami, 33, 65-66, 
278, 287, 328 
Lapuyan, 339 
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Laqua, 337 

Lara”, 343 

Laragiya, 362 

Lardil, 362 

Larevat, 352 

Lari, 326 

Laru (CARA), 310 

Laru (KORDOFANIAN), 
302 

Lashi, 333 

Lati, 337 

Latin, 27, 42, 287, 326 

Latvian, 37, 42, 60, 287, 327 

Lau, 351 

Laua, 356 

Laurentian, 205, 233, 367 

Lave, 336 

Lavongai, 349 

Lavukaleve, 361 

Lawa, 335 

Lawangan, 343 

Layolo, 342 

Laz, 74, 324 

Leco, 239, 371 

Lega, 313 

Legbo, 309 

Lehali, 352 

Lehalurup, 352 

Lehar, 303 

Leipon, 350 

Геке, 331 

Lela, 310 

Lelak, 343 

Lele, 320 

Lembena, 355 

Lemio, 356 

Гетого, 309 

Lemyo, 333 

Lenakel, 350 

Гепса, 213, 215, 219, 238, 
371 

Lengilu, 342 

Lengo, 351 

Lengola, 313 

Lengua, Northern, 375 

Lengua, Southern, 375 

Lenje, 315 

Lenkau, 350 

Lenyima, 309 

Lepcha, 331 

Lesser Antillean Creole, 377 

Letemboi, 352 

Letri Lgona, 346 

Leung, 335 

Levei, 350 

Lewada, 358 

Lewo, 352 


Leyigha, 309 

Lezgi, 324 

Lhanima, 364 

Lho-pa, 331 

Li, 149, 152, 288, 337 

Liang-shan, 333 

Liangmai, 332 

Libo, 306 

Libon, 340 

Libyan, Old—Numidian, 
East, 93, 320 

Ligbi, 303 

Liguri, 317 

Likpe, 308 

Likum, 350 

Lilau, 360 

Lillooet, 366 

Limba (BANTU), 312 

Limba (WEST ATLANTIC), 
101, 304 

Limbu, 332 

Limbum, 311 

Limilngan, 188, 362 

Linda, 307 

Lindrou, 350 

Lindu, 342 

Ling-yun, 337 

Lingarak, 352 

Lingkabau, 342 

Linngithigh, 363 

Lio, 346 

Lisabata, 345 

Lisu, 333 

Lithuanian, 37, 42, 44—45, 
60, 287, 327 

Litzlitz, 352 

Livonian, 328 

Lmam, 336 

Lobala, 313 

Lobi, 305 

Lobu, 342 

Loda, 359 

Logba, 308 

Logbara, 318 

Logo, 318 

Logol, 302 

Logoli, 314 

Logone, 321 

Lohiki, 355 

Lohorong, 332 

Lokele, 313 

Loko (MANDE), 303 

Loko (UPPER CROSS), 309 

Lokukoli, 309 

Lolopho, 333 

Lom, 344 
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Loma, 278, 303 
Lombi, 312 
Lombo, 313 
Lomwe, 315 
Longa, 349 
Longgu, 351 
Longuda, 102, 305 
Loniu, 350 
Lonwolwol, 352 
Looknon, 341 
Lopa, 310 
Lophomi, 332 
Lorediakarkar, 352 
Lorhon, 304 
Losengo, 95, 313 
Lotha, 332 

Lotud, 342 
Lotuxo, 318 

Lou, 359 

Loun, 345 

Loup A, 366 

Loup B, 366 
Lovaea, 359 
Loven, 336 

Lozi, 316 

Lii, 338 

Luangiua, 353 
Luba-Kasai, 95, 315 
Luba-Shaba, 315 
Lubang, 340 
Lubilo, 309 
Luchazi, 315 
Ludic, 328 

Lue, 312 

Luganda, 95, 286, 314 
Luhu, 345 
Luhupa, Southern, 332 
Luimbi, 315 
Luiseno, 237, 370 
Lukep, 348 

Lule, 375 

Lumbu, 313 

Luna, 315 
Lundayeh, 342 
Lundu, 312 
Lung-an, 337 
Lung-chow, 338 
Lungga, 350 
Lungmi, 333 
Lungu,310 

Luo, 286, 317 

Luri, 326 
Lushootseed, 366 
Luwian, 56, 58, 325 
Luxembourgeois, 60, 327 
Luyi, 315 

Luyia, 95, 314 
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Lwena, 315 
Lwo, 317 
Lyase, 310 
Lycian, 58, 325 
Lydian, 56, 325 
Lyele, 305 
Lyoniyorg, 309 


Ma'a, 323 

Ma'anyan, 343 

Maa', 337 

Maasai, 81f, 109, 286, 318 

Maba, 347 

Maban, 108, 317 

Mabang, 317 

Mabenaro, 376 

Mabiha, 315 

Mabo-Barkul, 310 

Mabuiag—Kala Lagaw Ya, 
188, 362 

Maca, 375 

Macaguaje, 372 

Macaguan, 373 

Macanese, 377 

Macedonian, 38, 60, 327 

Machiguenga, 374 

Macu, 372 

Macuna, 372 

Mada (CHADIC), 321 

Mada (NYIMA), 310 

Madak, 349 

Maden, 347 

Madi, 318 

Madik, 359 

Madngele, 362 

Madurese, 161, 344 

Mae, 352 

Maewo, Central, 352 

Mafa, 278, 321 

Mafea, 352 

Magahi, 325 

Magar, 331 

Magi, 356 

Magindanao, 340 

Magori, 349 

Magu, 311 

Mah Meri, 337 

Maha, 321 

Mahakam Kenyah, 343 

Mahas, 317 

Mahican, 366 

Mahigi, 357 

Mahwa, 320 

Maidu, Mountain, 368 

Maidu, Northwest, 368 

Maidu, Southern, 368 

Maii, 352 


Maipure, 374 
Mairasi, 356 

Mairasi, Northeastern, 356 
Maisin, 166f, 347 
Maithili, 325 

Maiwa, 356 

Maiya, 325 

Majang, 317 

Mak, 337 

Makaa, 312 

Makah, 366 

Makasai, 359 
Makassarese, 161, 342 
Makian, East, 346 
Makian, West, 359 
Maklew, 358 
Makonde, 315 
Makua, os, 315 
Makurapi, 373 
Makushi, 375 

Mal, 335 

Malacca Creole, 377 
Malagasy, 16of, 166, 288, 


343 
Malakmalak, 362 
Malamalai, 348 
Malango, 351 
Malas, 357 
Malasanga, 348 
Malaweg, 338 
Malay, 160f, 288, 344 
Malay, Bazaar, 378 
Malayalam, 138, 287, 33o 
Malayic Dayak, 344 
Malayo, 371 
Maldivian, 325 
Male (MADANG- 

ADELBERT RANGE), 357 
Male (OMOTIC), 322 
Maleu, 348 
Malfaxal, 352 
Malgana, 364 
Maligo, 301 
Malila, 315 
Malmariv, 352 
Malo, 352 
Maltese, 323 
Malto, 138—40, 330 
Malua Bay, 352 
Malvi, 325 
Malyangaba, 364 
Mam, 204, 288, 369 
Mama, 311 
Mamaa, 354 
Mamanwa, 341 
Mambai, 346 
Mambila, 311 
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Mamboru, 346 
Mambwe, 315 
Mampelle, 305 
Mamuju, 342 
Mamusi, 349 
Mamvu, 319 

Man Cao-lan, 338 
Managalasi, 356 
Manam, 347 
Manambu, 360 
Manang, 331 

Manao, 374 
Manchati, 331 
Manchu, 1275, 287, 329 
Manda (BANTU), 315 
Manda (DRAVIDIAN), 330 
Mandailing, 344 
Mandan, 233, 367 
Mandar, 342 
Mandara, 321 
Mandarin, 143, 287, 331 
Mandayan, 341 
Mande, 97, 111, 312 
Mander, 358 

Mandi, 360 
Mandinka, 95, 286, 303 
Manegir, 329 
Manem, 558 

Mang, 334 

Manga, 348 

Mangap, 348 
Mangarayi, 188, 362 
Mangareva, 353 
Mangarla, 364 
Mangas, 322 
Mangaya, 306 
Mangbai, 306 
Mangbetu, 319 
Mangbutu, 319 
Mangerr, 189, 362 
Manggarai, 346 
Mangkong, 535 
Mangoli, 345 
Mangseng, 349 
Mangue, 230, 371 
Maninka, 303 
Manipa, 345 
Maniteneri, 374 
Manitsawa, 373 
Manjaku, 303 
Mankan, 303 
Mankon, 311 

Mano, 303 

Manobo, Sarangani, 340 
Mansakan, 341 
Mansi, 65, 328 
Mantion, 359 
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Manusela, 345 

Many, 327 

Manyak, 333 

Manyika, 316 

Manza, 306 

Mao, 332 

Maonan, 337 

Maori, 353 

Mapanai, 574 

Mape, 354 

Mapena, 356 

Mapi, 355 

Mapia, 351 

Mapos, 348 

Mapoyo, 575 

Mapuche—Mapudungu, 
204, 230, 239, 289, 372 

Mapudungu, 204, 230, 239, 
289, 372 

Mara (AUSTRALIAN), 362 

Mara (TIBETO-BURMAN), 
333 

Maragus, 352 

Maralango, 348 

Maram, 332 

Maramba, 360 

Maranao, 340 

Maranungku, 362 

Marathi, 38, 286, 325 

Maratino, 369 

Marau (GEELVINK BAY), 


347 
Marau (MALAITA), 351 
Marawa, 374 
Marba, 320 
Maremgi, 358 
Margany, 363 
Margi, 321 
Margu, 362 
Mari (FINNIC), 69, 328 
Mari (SEPIK-RAMU), 360 
Maria, 356 
Mariate, 374 
Maricopa, 369 
Marimanindji, 362 
Marind, 355 
Marind, Bian, 355 
Marinduque, 340 
Maring (TIBETO- 
BURMAN), 332 
Maring (TRANS-NEW 
GUINEA), 355 
Maringarr, 362 
Maringe, 350 
Marino, 352 
Marithiyel, 362 


Markweta, 318 
Maropa, 376 
Marovo, 350 
Marquesan, 353 
Marshallese, 351 
Martuyhunira, 364 
Maru, 333 
Marubo, 376 
Marwari, 38, 325 
Masa, 320 
Masaba, 314 
Masaca, 372 
Masacara, 376 
Masbate, 341 
Masela, 346 
Mashakali, 241, 376 
Masi, 315 
Masiwang, 345 
Maskelynes, 352 
Masmaje, 320 
Masgan, 323 
Massachusett, 191, 205, 366 
Massenrempulu, 342 
Massep, 179, 377 
Matagalpa, 371 
Matanawi, 371 
Matapi, 374 
Mataua, 373 
Matbat, 347 
Matengo, 315 
Matepi, 357 
Matlatzinca, 370 
Matses, 376 
Mattole, 365 
Matukar, 347 
Matumbi, 315 
Maung, 362 
Maw, 338 
Mawacua, 374 
Mawae, 356 
Mawak, 357 
Mawan, 357 
Mawe, 373 
Mawes, 358 
May, 335 
Maya (AUSTRALIAN), 364 
Maya (SOUTH 
HALMAHERA), 347 
Mayi-Kulan, 363 
Mayi-Kutuna, 363 
Mayo, 360 
Mayogo, 307 
Mayubo, 376 
Mazahua, 204, 370 
Mazanderani, 38, 326 
Mazatec, 204, 370 
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Mba, 307 
Mbabaram, 363 
Mbagani, 315 
Mbai, 318 
Mbala, 315 
Mbaloh, 344 
Mbama, 313 
Mbangala, 314 
Mbangwe, 313 
Mbanja, 307 
Mbanza, 307 
Mbara, 363 
Mbariman-Gudhinma, 363 
Mbati, 313 
Mbato, 308 
Mbe, 311 
Mbede, 313 
Mbembe, 309 
Mbesa, 313 
Mbiywom, 363 
Mboa, 311 
Mboi, 306 
Mboka, 314 
Mboko (BANTU, ZONE А), 


312 
Mboko (BANTU, ZONE C), 
313 
Mbole, 313 
Mbomotaba, 313 
Mbong, 312 
Mboshi, 313 
Mbowe, 315 
Mbugu, 314 
Mbugwe, 314 
Mbula, 311 
Mbulungish, 303 
Mbum, 306 
Mbundu, North, 314 
Mbundu, South, 95, 316 
Mbunga, 315 
Mburku, 322 
Mbuun, 313 
Mbuunda, 315 
Mbwela, 315 
Mbwera, 315 
Mbwisi, 313 
Me'en, 317 
Meakambut, 360 
Meax, 359 
Mebu, 354 
Medebur, 347 
Median, 326 
Medlpa, 355 
Megiar, 547 
Megleno-Rumanian, 59, 326 
Mehek, 360 
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Mehinaku, 374 
Meidob, 317 
Meithei, 332 
Mekaf, 312 
Mekeo, 349 
Mekmek, 360 
Mekongka, 342 
Mekwei, 358 
Melam, 333 
Melamela, 349 
Melanau, 343 
Mele, 353 
Meluri, 332 
Mena, 357 
Mendalam Kayan, 343 
Mende, 95, 303 
Mendriq, 337 
Meneca, 375 
Mengen, 349 
Menghwa, 333 
Menia, 333 
Menien, 376 
Meningo, 359 
Menka, 311 
Menominee, 231, 366 
Mentawai, 344 
Menya, 355 
Meoswar, 347 
Merarit, 317 
Meriam, 188, 358 
Merlav, 352 
Meroitic, 377 
Meru, 314 
Mesalit, 317 
Mesem, 354 
Mesme, 320 
Metu, 333 
Mfinu, 313 
Mianka, 504 
Mianmin, 355 
Miao, 148, 154, 287, 334 
Micmac, 231, 366 
Micronesian Pidgin, 377 
Midhaga, 364 
Midsivindi, 361 
Midu, 331 

Mien, 148, 154, 287, 334 
Migabac, 354 
Migama, 320 
Migili, 310 
Mikarew, 361 
Mikasuki, 368 
Mikeyir, 317 
Mikir, 332 
Millcayac, 571 
Miltu, 320 


Mimi, (MABAN), 81, 317 

Mimi; (MABAN), 81, 317 

Min, Northern, 143, 331 

Min, Southern, 145, 331 

Minangkabau, 161, 344 

Minanibai, 357 

Minbu, 333 

Mindiri, 347 

Mingin, 188, 362 

Mingrelian, 74, 324 

Mini, 309 

Miniafia, 348 

Mintil, 337 

Miranya, 375 

Miri, 302 

Miriti, 372 

Miriwung, 362 

Misim, 348 

Misima, 348 

Miskito, 571 

Misool, 347 

Mituku, 313 

Miwok, Central Sierra, 368 

Miwok, Coast, 368 

Miwok, Lake, 368 

Miwok, Northern Sierra 368 

Miwok, Plains, 368 

Miwok, Southern Sierra, 
368 

Mixe, 368 

Mixtec, 191, 204, 289, 370 

Miya, 322 

Miyak, 360 

Mizo, 333 

Mnong, Central, 536 

Mnong, Eastern, 336 

Mnong, Southern, 336 

Mo, 305 

Moabite, 323 

Moba, 305 

Mocha, 322 

Mocho, 369 

Mocovi, 375 

Mod, 320 

Modang, 343 

Modele, 512 

Modole, 359 

Modra, 336 

Moere, 357 

Moewehafen, 349 

Mofu, 321 

Moghol, 127, 133, 328 


. Mogum, 320 


Mohave, 205, 288, 369 
Mohawk, 205, 288, 367 
Mohegan, 366 
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Moi, 359 

Mok (MON-KHMER), 335 

Mok (OCEANIC), 349 

Moken, 344 

Mokerang, 350 

Mokilese, 351 

Moklen, 344 

Mokulu, 320 

Molale, 217f, 368 

Molbog, 340 

Molima, 349 

Molo, 317 

Molof, 354 

Momare, 354 

Mombum, 355 

Mon (МОМ-КНМЕК), 143, 
151, 287, 336 

Mon (TIBETO-BURMAN), 
332 

Mona, 305 

Monde, 373 

Mondropolon, 350 

Mondunga, 307 

Mongo (BANTU), 313 

Mongo (EAST SUDANIC), 
317 

Mongol, 360 

Mongolian, Classical, 287, 
328 

Mongondow, 342. 

Mongoyo, 376 

Mongsen, 332 

Monguor, 329 

Moni, 356 

Mono (ADAMAWA), 306 

Mono (OCEANIC), 350 

Mono (UTO-AZTECAN), 
370 

Monom, 336 

Мопра, 331 

Monterey, 368 

Montol, 321 

Monumbo, 175, 360 

Monzombo, 307 

Mopan, 368 

Mopwa, 331 

Mor, (TRANS-NEW 
GUINEA), 354 

Mor; (EASTERN MALAYO- 
POLYNESIAN), 347 

Morafa, 354 

Moraid, 359 

Moran, 332 

Moraori, 358 

Morawa, 356 

Mordvin, 69, 328 
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Моге, 95, 305 

Morehead, Lower, 358 

Могеһеаа, Upper, 358 

Moresada, 357 

Mori, 342 

Morigi, 358 

Moriori, 353 

Moriwene, 374 

Moro, 302 

Morokodo, 318 

Morouas, 352 

Moru, 318 

Morunahua, 376 

Morwap, 354 

Mos, 337 

Moseten->Chimane, 228, 
241, 375 

Moshang, 332 

Mosimo, 357 

Mosina, 352 

Moso, 333 

Mota, 352 

Motilon, 238, 371 

Motlav, 352 

Motu, 349 

Movima, 239, 372 

Mozarabic, 326 

Mpalityanh, 363 

Mpi, 333 

Mpiemo, 312 

Mpoto, 315 

Mpotovoro, 352 

Mpuono, 313 

Mrabri, 335 

Mru, 332 

Mubi, 320 

Mudburra, 364 

Mugil, 357 

Muher, 323 

Muinane, 375 

Muk Thang, 362 

Mukawa, 348 

Muktele, 321 

Mukuru, 312 

Mulaha, 356 

Mulao, 337 

Muli, 333 

Mumeng, 348 

Mumuye, 306 

Muna, 342 

Mundang, 306 

Mundani, Lower, 312 

Mundani, Upper, 311 

Mundari, 148, 287, 334 

Mundu, 307 

Mundum, 311 


Munduruku, 373 

Mung, 333 

Munga, 306 

Mungaka, 311 

Munggui, 347 

Muniche, 239, 372 

Munit, 357 

Muniwara, 360 

Munji, 325 

Munkip, 354 

Munsee, 366 

Muóng, 355 

Mura, 371 

Murik, 360 

Murik Kayan, 343 

Muringam, 363 

Murle, 317 

Murrinh-Patha, 188, 362 

Murui, 375 

Murupi, 357 

Murut, Beaufort, 342 

Murut, Keningau, 342 

Murut, Selungai, 342 

Muruwari, 362 

Musak, 357 

Musar, 357 

Musey, 320 

Musgu, 321 

Musian, 459 

Muskogee, 205, 213, 288, 
368 

Musom, 348 

Mussau, 349 

Mutsun, 368 

Mutu, 348 

Mutum, 358 

Mutus, 377 

Muyuw, 349 

Mwa, 303 

Mwanga, 315 

Mwatebu, 348 

Mwera, 315 

Myene, 313 

Mzab, 320 


Na Nhyang, 335 

Nabak, 354 

Nabi, 346 

Nabte, 305 

Nachereng, 332 

Nadéb, 372 

Nadroga, 352 

Nafana, 304 

Naífri, 356 

Naga (MESO-PHILIPPINE), 
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Naga (OMOTIC), 322 

Naga Pidgin, 378 

Nagatman, 179, 377 

Nagovisi, 361 

Nagumi, 311 

Nahali, 155, 377 

Nahu, 354 

Nahuatl, 191, 204, 228, 277, 
289, 370 

Nahuatl, Classical, 370 

Nahukwa, 375 

Naiki, 138, 330 

Naka’ela, 345 

Nakama, 354 

Nake, 357 

Nakkara, 188, 362 

Nalca, 358 

Nali, 350 

Nalik, 349 

Nalu, 303 

Nama (JUKUNOID), 309 

Nama (KHOISAN), 79, 81, 
114, 116, 278, 286, 301 

Namakura, 352 

N/amani, 301 

Nambas, Big, 352 

Nambi, 359 

Nambikuara, Northern, 
220, 230, 288, 372 

Nambikuara, Southern, 
220, 230, 288, 372 

Nambu, 358 

Nameji, 333 

Namie, 360 

Namumi, 355 

Nancere, 320 

Nande, 314 

Nandi, 82, 109, 318 

Nandu, 310 

Nangalami, 325 

Nanggu, 361 

Nankina, 354 

Nanticoke, 366 

Nao, 322 

Napu, 342 

Nara, 349 

Narak, 355 

Narango, 352 

Narau, 358 

Nari Nari, 363 

Narom, 343 

Naron, 301 

Narragansett, 366 

Nasarian, 352 

Nasioi, 361 

Nata, 314 
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Natanzi, 326 
Natchez, 208-14 passim, 
218, 235, 368 
Natioro, 304 
Natu, 239, 372 
Naudem, 305 
Naueti, 346 
Naukan, 329 
Nauna, 350 
Nauruan, 351 
Navajo, 197, 204, 288, 365 
Navut, 352 
Nawuri, 308 
Naxi, 333 
Nayini, 326 
Ncha, 311 
Nchanti, 312 
Nchumburu, 308 
Nchumunu, 308 
Ndaktup, 311 
Ndali, 315 
Ndam, 320 
Ndamba, 314 
Ndao, 346 
Ndasa, 313 
Ndau (BANTU), 316 
Ndau (CELEBES), 342 
Nde, 311 
Ndebele, 316 
Ndembo, 315 
Ndengereko, 315 
Ndengese, 313 
Nding, 302 
Ndingi, 314 
Ndjébbana, 188, 362 
Ndo, 319 
Ndoe, 311 
Ndogo, 306 
Ndom, 358 
Ndombe, 316 
Ndonde, 315 
Ndonga, 316 
Ndoro, 311 
Nduga, 356 
Nduindui, 352 
Nduke, 350 
Ndumu, 313 
Ndut, 303 
Nedebang, 359 
Nee, 333 
Nefusi, 320 
Negidal, 329 
Nehan, 350 
Nek, 354 
Nekgini, 354 
Neko, 354 


Neku, 351 

Nembao, 350 

Nembe, 307 

Nemi, 351 

Nenets, 328 

Nengaya, 348 

Nengone, 351 

Nepali, 38, 325 

Nera, 112, 278, 317 

Nere, 351 

Nete, 355 

Newari, 331 

Nez Perce, 368 

Ng men, 333 

Ng oma, 318 

Ngada, 346 

Ngadjunmaya, 189, 364 

Ngaing, 354 

Ngala (SEPIK-RAMU), 360 

Ngala (ТІВЕТО-ВОКМАМ), 
333 

Ngalakan, 362 

Ngalik, North, 356 

Ngalik, South, 356 

Ngaliwuru, 362 

Ngalkbon, 362 

Ngalum, 355 

Ngamini, 364 

Ngamo, 321 

Nganasan, 328 

Ngandi, 362 

Ngando (MONGO), 313 

Ngando (NGANDO), 313 

Ngandyera, 316 

Ngangea, 318 

Ngangela, 315 

Nganyaywana, 363 

Ngardi, 364 

Ngare, 313 

Ngariawan, 348 

Ngarigu, 363 

Ngarinman, 364 

Ngarinyeri, 364 

Ngarla, 364 

Ngarndji, 362 

Ngayawung, 364 

Ngbaka, 306 

Ngbandi, 306 

Ngbundu, 307 

HINg.Ve, 301 

Ngenkikurrunggur, 362 

Ngeq, 335 

Ngie, 312 

Ngile, 302 

Ngindo, 315 

Ngiri, 313 
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Ngiyambaa, 363 

Ngizim, 321 

Ngkoth, 363 

Ngombe (BANTU), 513 
Ngombe (UBANGIAN), 307 
Ngoni, 316 

Ngu Ngwoni, 313 

Ngubu, 307 

Ngul, 313 

Ngulu (BANTU, ZONE G), 


314 
Ngulu (BANTU, ZONE P), 
315 
Ngumba, 312 
Ngundi, 313 
Ngundu, 307 
Ngurimi, 314 
Ngwe, 311 
Ngwele, 314 
Ngwii, 313 
Ngwo, 311 
Ngwoi, 310 
N/hain.tse, 301 
Nhanda, 364 
Nhang, 337 
Nhengatu, 373 
N/huki, 301 
Nhuwala, 364 
Niarawa, 376 
Nias, 344 
Nicobarese, 337 
Nicola, 365 
Nielim, 306 
Nii, 355 
Nimbari, 102, 305 
Nimboran, 358 
Nimi, 554 
Nimo, 359 
Nimowa, 348 
Ninam, 371 
Nindem, 310 
Ninggera, 358 
Ninggirum, 355 
Ningil, 559 
Ninzam, 310 
Nipsan, 358 
Nishi, 331 
Nitinat, 366 
Niuafo'ou, 353 
Niuean, 352 
Njebi, 313 
Njem (BANTU, ZONE A), 
12 
Njem (GRASSFIELDS 
BANTU), 312 
Nkangala, 315 
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Nkem, 311 

Nkonya, 308 

Nkoya, 315 

Nkriang, 335 

Nkumbi, 316 

Nkumm, 311 

Nkutu, 313 

Nkwen, 311 

Nnam, 311 

Nocte, 332 

Nocten, 375 

Nogai, 328 

Noho, 312 

Nokopo, 354 

Nokuku, 352 

Nomad, 355 

Nomane, 355 

Nomatsiguenga, 374 

Nomlaki, 368 

Nomu, 354 

Non, 303 

Noni, 312 

Nonuya, 375 

Nooksack, 366 

Nootka, 232, 288, 366 

Norwegian, 37, 60, 277f, 
287, 327 

Notre, 305 

Notsi, 349 

Nottoway, 233, 367 

Notu, 356 

Nruanghmei, 332 

Ntem, 311 

Ntenyi, 532 

Ntomba, 313 

Nuaulu, 345 

Nuer, 318 

Nuk, 354 

Nukuoro, 353 

Numanggang, 354 

Nume, 352 

Numee, 351 

Numidian, East, 95, 320 

Numu, 303 

Nung (TAI), 338 

Nung (TIBETO-BURMAN), 
333 

Nungali, 362 

Nunggubuyu, 188, 562 

Nungu, 310 

Nuni, 305 

Nuoku, 335 

Nupe, 97, 308 

Nusalaut, 346 

Nutabe, 371 

Nwa, 303 


Nyabwa, 304 
Nyah Kur, 336 
Nyahon, 336 
Nyakyusa, 315 
Nyala (BANTU), 314 
Nyala (EAST SUDANIC), 
317 
Nyalgulgule, 317 
Nyali, 313 
Nyamal, 364 
Nyamwezi, 314 
Nyaneka, 316 
Nyanga, 313 
Nyangbo, 308 
Nyangumarta, 364 
Nyanja, 95, 315 
Nyawaygi, 363 
Nyende, 305 
Nyengo, 315 
Nyi, 333 
Nyigina, 362 
Nyiha, 315 
Nyika, 314. 
Nyilamba, 314 
Nyimang, 317 
Nyiyabali, 364 
Nyo’o, 312 
Nyole, 314 
Nyoro, 314 
Nyuli, 314 
Nyuinyul, 362 
Nyungar, 189, 364 
Nyungwe, 315 
Nzakara, 307 
Nzangi, 321 
Nzema, 308 


!Ó, 301 

Oas, 340 
Obang, 312 
Obanliku, 310 
Obo, 340 

Obolo, 309 
Obulom, 309 
Ocaina, 375 
Ocorono, 373 
Ocuiltec, 370 
Odionganon, 341 
Odual, 309 

Ofo, 233, 367 
Ogbah, 309 
Ogbia, 309 
Ogbogolo, 309 
Ogbronuagum, 309 
Ohlone, 368 

Oi, 336 
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Oirat, 329 

Оігаќа, 359 
Ojibwa, 231, 288, 366 
Okanagan, 366 
Okiek, 318 
Oko-Juwoi, 354 
Okobo, 309 
Okomanjan, 312 
Okpamheri, 309 
Окре; (EDO), 309 
Okpe, (EDO), 309 
Oksapmin, 354 
Oku, 312 

Olcha, 329 

Old Church Slavonic, 327 
Olo, 359 

Olonets, 328 
Olulumo, 309 
Oluta, 368 
Omagua, 373 
Omaha, 205, 367 
Omati, 357 
Ombo, 313 
Ometo, 322 
Omie, 356 

Omok, 328 
Omotik, 119, 318 
Omurano, 371 
Onabasulu, 355 
One, 359 
Oneida, 367 
Ong, 335 
Ongamo, 318 
Onge, 354 

Onin, 346 

Onjob, 356 

Ono, 354 
Onondaga, 233, 367 
Opao, 358 

Opata, 370 
Opaye, 241, 376 
Opo, 319 

Opone, 375 

{Ora, 301 

Orang Hulu, 344 
Orang Kanaq, 344 
Orang Laut, 344 
Orang Seletar, 344 
Ordos, 329 
Orejon, 372 
Orejone, 375 
Oriya, 38, 325 
Ormu, 347 
Ormuri, 326 
Oroch, 329 
Orochon, 329 
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Oroha, 351 

Orok, 329 
Orokaiva, 356 
Orokolo, 358 
Oromo, 86, 278, 286, 323 
Oron, 309 

Orowe, 351 
Osage, 367 
Oscan, 326 
Oshie, 312 
Ososo, 309 
Ossetic, 325 
Osum, 357 
Otank, 311 

Oti, 241, 376 
Otomaco, 24o, 373 
Otomi, 204, 289, 370 
Otuke, 376 
Ouma, 349 
Оугепа, 354 
Owiniga, 359 
Oyampi, 373 
Oyda, 322 
Oykangand, 363 


P'ou-la, 333 

P'uman, (ANGKUIC), 335 
P'uman; (BULANG), 335 
Pa, 355 

Pa-o, 331 

Pa'a, 322 

Pa'umotu, 353 

Paama, 352 

Paang, 533 

Pacahuara, 376 

Pacóh, 335 

Pacu, 374 

Padang, 318 

Padaung, 531 

Paduko, 321 

Paez, 289, 371 

Pagi, 358 

Pagu, 359 

Pahari, Western, 38, 325 
Pahi, 360 

Pahri, 331 

Pai (NYIMA), 310 

Pai (SEPIK-RAMU), 360 
Pai-lang, 333 

Paici, 351 

Paipai, 369 

Paiute, Northern, 370 
Paiute, Southern, 370 
Paiwan, 338 

Pajonal, 374 

Pak, 350 


Paka, 360 
Pakaasnovos, 373 
Pakanha, 363 
Pakatan, 335 
Pakishan, 333 
Pakot, 318 

Pakü, 331 

Palaic, 56, 325 
Palaka, 304 
Palamul, 347 
Palauan, 166ff, 338 
Palawefo, 340 
Pale, 334 
Palenquero, 377 
Palicur, 374 
Pallangahmiddang, 362 
Palmella, 375 
Palu'e, 346 
Paluan, 342 

Pam, 306 

Pama, 375 
Pambia, 307 
Pame, 370 
Pamigua, 373 
Pamona, 342 
Pana, 305 
Panaras, 361 
Panare, 375 
Pandan, (BIKOL), 340 
Pandan, (KINARAYAN), 341 
Pande, 313 
Pangasinan, 161, 339 
Pangutaran, 343 
Pangwa, 314 
Panim, 357 
Pankararu, 239, 372 
Pankhu, 333 
Pao-an, 329 
Papapana, 350 
Papar, 342 
Papasena, 358 
Papel, 303 

Papi, 360 
Papiamento, 377 
Papitalai, 350 
Papora, 338 
Papuma, 347 
Parachi, 326 
Parakaná, 373 
Paranan, 338 
Paranawat, 373 
Paraujano, 374 
Paravilhana, 375 
Parawa, 372 
Parawen, 357 
Pareca, 375 
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Paressi, 374 

Рагігі, 375 

Parji, 139-40, 330 

Parquenahua, 376 

Parthian, 326 

Parukoto, 375 

Parya, 325 

Pashai, 325 

Pashto, 38, 286, 325 

Pasi, 360 

Pasismanua, 349 

Passam, 306 

Passamaquoddy, 231, 366 

Passe, 374 

Patamona, 375 

Patani, 347 

Patpatar, 349 

Patwin, 368 

Paulohi, 346 

Paumari, 373 

Pauserna, 373 

Paushi, 375 

Paushiana, 375 

Pauwi, 179, 377 

Pawaian, 357 

Pawnee, 232, 288, 367 

Pay, 357 

Paya, 215, 219, 238, 371 

Paynamar, 357 

Pazeh, 338 

Pear, 336 

Peba, 374 

Pemba, 314 

Pemon, 375 

Penchal, 350 

Pengo, 330 

Pentlatch, 366 

Pero, 321 

Persian—Farsi, 38, 42, 278, 
286, 326 

Persian, Old, 27, 326 

Petapa, 342 

Petats, 350 

Phai, 335 

Phalok, 335 

Phalura, 325 

Phana, 333 

Phang, 335 

Phayeng, 333 

Phlon, 331 

Pho, 331 

Phoenician, 323 

Phom, 332 

Phray, 335 

Phuan, 338 

Phun, 333 
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Phunoi, 333 
Phuong, 335 
Phupha, 333 
Piamatsina, 352 
Piame, 360 
Pianakoto, 375 
Piapoco, 374 
Piaroa, 372 
Pichobo, 376 
Picuris, 370 

Pije, 351 

Pila, 357 

Pilaga, 375 

Pileni, 353 

Pima, 370 

Pima Bajo, 370 
Pimbwe, 315 
Pimenteira, 375 
Pinai, 360 
Pingfang, 332 
Pinji, 313 

Pinyin, 311 
Pipikoro, 342 
Pipil, 370 
Piratapuyo, 372 
Piri, 313 
Ріго(АКАУУАКАМ), 374 
Piro (ТАМОАМ), 236, 370 
Pisa, 355 

Pisabo, 376 
Pitcairnese, 377 
Piti, 309 
Pitta-Pitta, 364 
Pitu Ulunna Salo, 342 
Piu, 348 

Piva, 35o 

Plang, 335 
Playero, 373 
Po-se, 337 
Pochuri, 332 
Pochutla, 237, 37O 
Pocomam, 369 
Pocomchi, 369 
Podopa, 357 
Podzo, 315 
Pogaya, 355 
Pogoro, 314 

Pok, 318 
Pokomo, 314 

Pol, 312 

Polabian, 327 
Polchi, 322 
Polish, 38, 42, 6o, 287, 327 
Polonombauk, 352 
Pom, 347 

Pomo, Central, 211, 234, 


369 


Pomo, Eastern, 211, 234, 


3 

Pomo, Northeastern, 211, 
234, 369 

Pomo, Northern, 211, 234, 
369 

Pomo, Southeastern, 211, 
234, 369 

Pomo, Southern, 211, 234, 
369 

Ponam, 350 

Ponapean, 351 

Pondoma, 357 

Pongo, 312 

Pongu, 310 

Ponosakan, 342 

Poong, 335 

Popayan, 371 

Popoloc, 371 

Popoluca, Sierra, 368 

Porome, 179, 377 

Port Sandwich, 352 

Port Vato, 352 

Portuguese, 37, 204, 287, 
326 

Poso, 344 

Potawatomi, 231, 366 

Potiguara, 373 

Pou Ma, 335 

Powhatan, 366 

Poyanawa, 376 

Prouac, 336 

Provençal, 37, 287, 326 

Prussian, Old, 327 

Puari, 359 

Pucapacuri, 374 

Puelche, 239, 373 

Puinave, 289, 372 

Pukapuka, 353 

Puku, 310 

Pulabu, 356 

Pulie, 349 

Punan Aput, 343 

Punan Bah, 343 

Punan Merah, 343 

Punan Merap, 343 

Punan Tubu, 343 

Punan-Nibong, 343 

Puncuri, 374 

Punjabi, 38, 286, 325 

Punu (BANTU), 313 

Punu (МАО-ҮАО), 334 

Puragi, 358 

Purari, 358 

Puri, 376 

Purubora, 373 

Purum, 332 


Language Index 


Putai, 321 
Putatan, 344 
Putoh, 342 
Puyuma, 338 
Pwaamei, 351 
Pwapwa, 351 
Pwo, 305 
Pyapun, 321 
Pyen, 333 
Pyu, 359 


Qashqay, 328 

Qawasqar, 278, 372 

Quaca, 375 

Quapaw, 367 

Quechua A, 191, 204, 208, 
289, 372 

Quechua B, 191, 204, 208, 
289, 372 

Qui-chau, 537 

Quiche, 204, 288, 369 

Quileute, 288, 360 

Quinault, 366 

Quinigua, 369 

Quiripi, 366 

Quitemoca, 375 

Оха, 116, зот 


Кара, 352 

Raglai, Cacgia, 344 
Raglai, North, 344 
Raglai, South, 344 
Rai, 332 

Rajah, 340 

Raji, 331 

Ralte, 332 

Rama, 371 
Ramarama, 373 
Rampi, 342 
Rangkas, 331 
Rangloi, 331 

Rao, 360 

Карапші, 353 
Rapting, 357 
Rarotongan, 353 
Ratahan, 342 
Raute, 331 

Rauto, 349 

Rawa, 354 
Rawang, 333 
Rawo, 359 

Rejang, 161, 344 
Rejang Kayan, 343 
Rek, 318 
Rembarrnga, 362 
Remo (MUNDA), 334 
Remo (РАМОАМ), 376 
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Rémoxbo, 376 
Rempi, 357 
Rendille, 323 
Rengao, 336 
Rengma, 332 
Rennellese, 353 
Repanbitip, 352 
Rerau, 357 

Кеге, 302 

Кегер, 352 

Reshe, 310 
Resigaro, 374 
Reyesano, 376 
Rhade, 344 

Riang, 334 
Riantana, 358 
Ribun, 343 

Riff, 86, 286, 520 
Rikbaktsa, 241, 376 
Rimi, 314 

Кігіо, 350 
Ritharngu, 362 
Riung, 346 

Roba, 306 

Rocky Peak, 359 
Rodong, 332 
Roinji, 348 

Roma, 346 
Romansch, 326 
Romany, 286, 325 
Romblonanon, 341 
Romkun, 361 

Ron (CHADIC), 321 
Ron (GEELVINK BAY), 347 
Ronga, 95, 316 
Ronge, 317 

Roper River Creole, 377 
Roria, 352 

Roro, 349 

Roti (SAMA BAJAW), 344 
Roti (TIMOR), 346 
Rotokas, 361 
Rotuman, 165, 352 
Roucouyene, 375 
Roviana, 350 

Rue, 315 

Ruguru, 314 
Ruihi, 315 

Rukai, 338 
Rukuba, 310 
Rumai, 334 
Rumanian, 37, 59, 287, 326 
Rumaya, 309 
Rumsen, 368 
Runa, 571 

Rundi, 95, 314 
Runga, 517 


Rungchenbung, 332 
Rungu, 315 

Rungus, 342 

Rungwa, 315 

Runic, 327 

Ruruma, 309 

Rusa, 314 

Russian, 38, 42, 287, 327 
Russo-Norsk, 378 
Rutul, 324 

Ruund, 315 

Rwanda, 95, 286, 314 
Rwo, 314 

Ryukyuan, 132, 287, 329 


Sa, 552 

Sa'a, 351 

Sa'ban, 342 

Sa'dan, 342 

Sa'och, 336 

Saami, Eastern, 35, 65—66, 
278, 287, 328 

Saami, Northern, 35, 
65—66, 278, 287, 328 

Saami, Southern, 33, 
65—66, 278, 287, 328 

Saaroa, 338 

Saba, 320 

Sabanes, 372 

Sabellian, 278, 326 

Saberi, 356 

Sabeu, 333 

Sabum, 337 

Sacapultec, 369 

Sadanga, 339 

Sae, 374 

Saek, 337 

Saep, 356 

Safalaba, 305 

Safan, 303 

Safeyoka, 355 

Safwa, 315 

Sagala, 314 

Sahaptin, 208, 214, 218, 368 

Saho, 322 

Sahu, 359 

Saija, 371 

Saisiyat, 338 

Sajau Basap, 345 

Sak, 333 

Saka, 325 

Sakam, 354 

Sakao, 352 

Sakata, 313 

Saki, 357 

Salampasu, 315 

Salar, 328 
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Salawati, 347 

Saleman, 345 

Saliba, 372. 

Salinan, 208, 214, 218, 369 

Saluan, 342 

Salug, 339 

Saluma, 375 

Sama, 514 

Sama, Central, 343 

Sama, North, 343 

Sama, South, 343 

Samagir, 329 

Samar, North, 161, 341 

Samarokena, 356 

Samoan, 161, 288, 353 

Samosa, 357 

Samre, 336 

San, 303 

San Francisco, 368 

San Ignacio, 373 

San Jose, 368 

Sanapana, 375 2 

Sandawe, 116— 19, 84, 117, 
301 

Sandia, 570 

Sane, 303 

Sanga, 315 

Sanggau, 345 

Sangil, 342 

Sangir, 342 

Sangisari, 326 

Sangke, 359 

Sanglechi, 325 

Sango (ВАМТО), 314 

Sango (UBANGIAN), 95, 
306 

Sangu, 313 

Sani, 333 

Sanio, 360 

Sanjo, 315 

Sanskrit, 27, 39-40, 47, 50- 
53, 325 

Santa, 329 

Santa Cruz (EAST РА- 
PUAN), 361 

Santa Cruz (PENUTIAN), 
368 

Santali, 148, 287, 334 

Santiam, 367 

Sanuma, 371 

Sapara, 375 

Saparua, 346 

Sape, 372 

Sapiboca, 376 

Sapiny, 318 

Saposa, 350 

Sapuan, 336 
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Sapuya, 373 

бага, 318 
Saramaccan, 377 
Sarangani Bilaan, 341 
Sarangani Manobo, 340 
oarare, 372 
Saraveca, 374 
Sarcee, 199, 365 
Sardinian, 37, 59, 326 
Sari, 306 

Saruga, 357 

Sarwa, 320 

Sasak, 161, 344 
Sasaru, 309 

Sasawa, 356 

Sau, 355 

Sauk, 354 

Sause, 358 

Sausi, 356 

Savara, 330 
Savosavo, 361 

Savu, 346 

Sawai, 347 

Sawos, 360 

Sawuy, 355 

Sayula, 368 

Scottish Gaelic, 33, 327 
Scythian, 325 

Sea Island Creole, 377 
Seba, 315 

Sebob Kenyah, 343 
Sec, 239, 371 
бесоуа, 372 
Sedang, 336 

Sedeg, 338 

Sedoa, 342 

берді, 343 

Seget, 359 

Seimat, 349 

Sekani, 365 

Sekar, 346 

Seke, 352 

Seko, 342 

Sekyani, 313 

Selaru, 346 

Sele, 308 

Selepet, 354 
Selknam, 372 
Selkup, 328 
Selungai Murut, 342 
Selwasa, 346 

Semai, 337 

Semaq Вегі, 337 
Sembakung, 342 
Seme, 102, 304 
Semelai, 337 


Semimi, 356 

Semnam, 337 

Semnani, 326 

Sempan, 355 

Sena, 315 

Senari, 304 

Sene, 354 

Seneca, 367 

Sened, 320 

Senga, 315 

Sengele, 313 

Senggi, 358 

Sengmai, 333 

Sensi, 376 

Sentani, 356 

Sentinel, 354 

Sepa, 347 

Sepen, 361 

Sera, 347 

Serbo-Croatian, 38, 60, 287, 
327 

Sere, 306 

Serer, зо» 

Seri, 209, 213f, 218f, 234, 
369 

Serrano, 370 

Serua, 346 

Serudung, 342 

Serui-Laut, 347 

Seshelt, 366 

Seta, 359 

Setaman, 355 

бен (SERAM), 345 

бен (TORRICELLI), 359 

Sewa Bay, 349 

Sewacu, 373 

Sewe, 306 

Sezo, 322 

Sgaw, 331 

Sha, 321 

Shagawu, 321 

Shahuanahua, 376 

Shako, 322 

Shall, 310 

Shambala, 314 

Shan, 149, 288, 338 

Shangge, 332 

Shani, 309 

Shant'ou, Yunnan, 338 

Sharanahua, 376 

Shark Bay, 352 

Shasta, 208, 210f, 217f, 234, 
369 

Shatt, 317 

Shawiya, 320 

Shawnee, 231, 366 


Language Index 


She, 322 

Shebayo, 374 

Sheliff Basin, 320 

Sheta, 373 

Shi, 314 

Shilha, 86, 93, 286, 320 

Shilluk, 109, 317 

Shina, 325 

Shipaya, 373 

Shipibo, 376 

Shipinawa, 376 

Shirumba, 302 

Sho, 333 

Shompen, 337 

Shona, 95, 286, 316 

Shonshe, 333 

Shor, 328 

Shoshone, 236, 237, 370 

Shua, 301 

Shuar, 373 

Shubi, 314 

Shughni, 326 

Shukuru, 239, 372 

Shumashti, 325 

Shuswap, 288, 366 

Shwe, 334 

Siagha, 355 

Sialum, 354 

Siamese—Thai, 149, 288, 
338 

Siane, 354 

Siang, 345 

Siar, 349 

Siboma, 348 

Sibutuq, 345 

Sidamo, 322 

Sie, 350 

Sierra Popoluca, 368 

Sighu, 313 

Sihan, 357 

Sihia, 333 

Sikaiana, 353 

Sikka, 346 

Sila, 317 

Silakau, 343 

Sileibi, 357 

Siliput, 359 

Silisili, 348 

Silopi, 357 

Simaa, 315 

Simalungan, 344 

Simbari, 355 

Simbo, 350 

Simeulue, 344 

Simog, 358 

Sinabo, 376 
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Sinagoro, 349 
Sinauna, 339 
Sindangan, 339 
Sindhi, 38, 325 
Singgie, 343 

Sinhalese, 38, 325 
Sinji, 314 

Sinsauru, 356 

Sinsiga, 371 

Sinyar, 318 

Sio, 348 

Siocon Subanon, 339 
Siona, 372 

Sipacapa, 369 

Sipo, 373 

Sira, 313 

Sirak, 348 

Sirasira, 348 

Siraya, 338 

Sirenik, 329 

Siri, 322 

Siriano, 372 

Sirimeri, 374 

Sirio, 356 

Siriono, 373 

Sirkai, 358 

Siroi, 356 

Sisi, 350 

Sissano, 347 

Siti, 305 

Siuslaw, 214, 218, 367 
Sivandi, 326 

Siwa, 93, 320 

Siwai, 361 

Siwu, 308 

Siyin, 332 

Sko, 359 

Slave, 365 

Slavonic, Old Church, 327 
Slovak, 38, 60, 287, 327 
Slovene, 38, 60, 327 
So (BANTU, ZONE A), 312 
So (BANTU, ZONE С), 313 
So (EAST SUDANIC), 318 
So (MON-KHMER), 335 
Sobei, 347 

Soddo, 323 

Soga, 314 

Sogdian, 325 

Soi, 326 

Sok, 336 

Sokna, 93, 320 

Sokoro, 320 

Soledad, 368 

Soli, 315 

Solomon Pijin, 377 


Solon, 329 

Solos, 350 

Som, 354 

Somahai, 355 

Somali, 86, 286, 323 

Some, 355 

Somrai, 320 

Somray, 336 

Son, 335 

Sona, 356 

Songhai, 81, 98, 1086, 111, 
286, 317 

Songo, 314 

Songomeno, 313 

Songum, 357 

Songye, 315 

Sonia, 355 

Soninke, 302 

Sonjo, 314 

Sora, 334 

Sorbian, Lower, 327 

Sorbian, Upper, 327 

Sori, 350 

Soruba, 305 

Sotho, Northern, 95, 316 

Sotho, Southern, 95, 316 

Southwest Bay, 352 

Sowa, 352 

Spanish, 37, 204, 287, 326 

Spanish, Bamboo, 377 

Squamish, 288, 366 

Sranan, 377 

бге, 336 

Stieng, 335 

Straits, Northern, 366 

Su, 312 

Sua, 101, 304 

Suamia, 314 

Suau, 348 

Subanon, Siocon, 339 

Subia, 315 

Subtiaba, 208, 369 

биеі, 335 

Suena, 356 

Sui, 337 

Suki, 355 

Suku, 314 

Sukuma, 95, 314 

Sukur, 321 

Sukurum, 348 

Sula, 345 

Sulamu, 344 

Sulka, 361 

Sumambu, 342 

Sumariup, 360 

Sumau, 356 
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Sumbanese, East, 346 

Sumbawa, 344 

Sumbwa, 314 

Sumchu, 331 

Sumerian, 377 

Sumo, 371 

Sundanese, 161, 288, 344 

Sungam, 331 

Sungor, 317 

Sunwar, 332 

Suoy, 336 

Suppire, 304 

Sug, 336 

Sura, 321 

Suri, 317 

Surigaonon, 341 

Sursurunga, 349 

Surubu, 309 

Surui, 373 

Susguehannock, 233, 367 

Susu, 302 

Suwawa, 342 

Suya, 377 

Svan, 324 

Swahili, 95, 286, 314 

Swahili, Asian, 377 

Swaka, 315 

Swati, 316 

Swedish, 37, бо, 2771, 287, 
327 

Syan, 314 

Syriac, 323 


T'ien-pa, 337 
Та-Оу, 335 
Taabwa, 315 
Tabar, 349 

Tabaru, 359 
Tabasaran, 324 
Тасапа, 289, 376 
Tadkon, 312 

Tado, 371 
Tadyawan, 339 
Tagabawa, 340 
ТараКаоіо, 341 
Tagalog, 161, 164, 288, 340 
Tagbana, 304 
Tagbanwa, 340 
Tagbu, 306 

Tagen, 331 

Tagoi, 302 

Tagula, 348 
Tahitian, 161, 288, 353 
Tahltan, 199, 365 
Tai-Loi, 149, 335 
Tai, Black, 149, 338 
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Tai, Red, 149, 338 
Tai, White, 149, 338 
Taikat, 358 

Таіпо, 240, 373 
Tairora, 354 

Taita, 314 

Tajiki, 326 
Tajuason, 304 
Takelma, 208, 214, 217f, 367 
Takia, 347 

ТаКиа, 336 
Takunape, 373 
Takuu, 353 

Tal, 321 

Talaud, 342 

Tali, 331 

Taliabo, 345 

Talise, 351 

Talne, 305 

Talysh, 326 

Tama (EAST SUDANIC), 


317 
Tama (ТОСАМОАМ), 372 
Tamagario, 356 
Tamahaq, 320 
Tamalu, 333 
Taman, 333 
Tamanaco, 375 
Tamang, 331 
Tamari, 305 
Tamasheq, 320 
Tamazheq, 320 
Tamazight, 86, 95, 286, 320 
Tambanua, 342 
Tambotalo, 352 
Tamezret, 320 
Tami, 348 
Tamil, 138, 287, 330 
Tampuan, 336 
Tampulma, 305 
Tanacross, 365 
Tanah Merah, (MAIRASI- 
ТАМАН МЕКАН), 356 
Tanah Merah, (ЅЕМТАМІ), 
356 
Tanaina, 197, 365 
Тапапа, 365 
Тапапа, Upper, 365 
Тапа, 347 
Tangale, 321 
Tangga, 349 
Tangkhul, 332 
Tanglapui, 359 
Tangoa, 352 
Тапрва, 332 
Tangu, 361 


Tanguat, 361 

Тапі, 357 

Тапіппа, 350 

Tanimbili, 350 

Tanimuca, 372 

Tanjong, 343 

Tanna, North, 350 

Tanna, Southwest, 350 

Tao, 357 

Taogwe, 358 

Taokas, 338 

Taori-Kei, 358 

Taori-So, 358 

Taos, 370 

Tapiete, 373 

Tapii, 376 

Tapirape, 373 

Tarahumara, 370 

Taram, 306 

Tarangan, 346 

Tarao, 332 

Tarascan, 191, 208, 215—20 
passim, 230, 238, 289, 371 

Tareng, 335 

Tariano, 374 

Tarof, 358 

Tarok, 310 

Tarpia, 347 

Taruma, 220, 240, 372 

Tasaday, 340 

Tasmanian, 288, 354 

Tasmate, 352 

Tatana, 342 

Tatar, 127, 287, 328 

Tatar, Crimean, 328 

Tate, 358 

Tati, 326 

Tatuyo, 372 

Tauade, 356 

Taubuid, 278, 340 

Taujjut, 520 

Taulil, 361 

Taupota, 348 

Taura, 309 

Taurap, 179, 377 

Taushiro, 375 

Tausug, 341 

Tauya, 357 

Taware, 372 

Taworta, 358 

Tay, 338 

Tay Nua, 338 

Tayari, 305 

Tboli, 278, 541 

Tebilung, 342 

Teco, 369 
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Tegali, 302 

Tegem, 302 

Tehit, 359 

Tehuelche, 372 

Teke, Central, 313 

Teke, East, 315 

Teke, North, 313 

Teke, Northeast, 313 
Teke, South, 313 

Teke, West, 313 

Telefol, 355 

Telugu, 138—40, 287, 330 
Teluti, 346 

Tem, 305 

Tembe, 373 

Tembo (BANTU, ZONE J), 


314 

Tembo (BANTU, ZONE M), 
315 

Teme, 3o6 

Temiar, 337 

Temne, 304 

Temoq, 337 

Temuan, 344 

Tengsa, 332 

Tenharim, 373 

Tenis, 549 

Teop, 350 

Teor, 545 

Tep, 311 

Tepecano, 570 

Tepehua, 368 

Tepehuan, Northern, 370 

Tepehuan, Southern, 370 

Tequistlatec, 209, 213, 289, 
369 

Tera, 320 

Terena, 374 

Teribe, 371 

Terik, 318 

Ternate, 359 

Teso, 318 

Tetela, 313 

Tetete, 372 

Tetun, 346 

Teun, 346 

Teushen, 372 

Tewa (PANTAR), 359 

Tewa (ТАМОАМ), 236, 370 

Texixtepec, 368 

Tfalati, 367 

Thaayorre, 363 

Thado, 332 

Thai, 128, 338 

Thai, Northern, 149, 338 

Thai, Phu, 149, 338 
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Thai, Southern, 149, 288, 
338 
Thakali, 331 
Thami, 331 
Тһао, 338 
Тһар, 335 
ТһагаКа, 314 
Thargari, 364 
Thavung, 335 
Thawa, 363 
Thayetmo, 333 
Тһаурап, 363 
Тһе, 336 
Тһерог, 331 
Then, 337 
Tho, 338 
Thochu, 332 
Thompson, 366 
Thongho, 333 
Thukumi, 332 
Thulung, 332 
Thuri, 517 
Tiang, 349 
Tibetan, Central, 145, 287, 


331 
Tibetan, Classical, 145, 287, 


331 

Tibetan, Western, 143, 287, 
331 

Ticuna, 289, 372 

Tidikelt, 320 

Tidong, 342 

Tidore, 359 

Tiefo, 304 

Tien-chow, 337 

Tien-pao, 338 

liene, 313 

Tifal, 355 

Tigak, 349 

Tigre, 323 

Tigrinya, 86, 286, 323 

Tigwa, 340 

Tikar, 512 

Tikopia, 353 

Tikuu, 314 

Tillamook, 232, 366 

Tima, 302 

Timbe, 354 

Timor Pidgin, 377 

Timorese, 346 

Timote, 372 

Timucua, 191, 205-8 pas- 
sim, 214-18, 230, 238-39, 
371 

Timugon, 342 

Tina, 339 


Tindi, 324 

Tingal, 302 

Tinigua, 373 

Tinputs, 350 

Tirahi, 525 

Tiri, 351 

Tirio, 358 

Tiro, 302 

Tiruray, 341 

Tita, 309 

Titan, 350 

Tiv, 95, 311 

Tiwi, 188, 362 

Tjungundji, 363 

Tjurruru, 364 

Tlamelula, 369 

Tlapanec, 213, 234, 237, 
289, 369 

Tlemcen, 320 

Tlingit, 195, 197-99, 208-18 
passim, 223, 228, 365 

Tlongsai, 335 

Tmagurt, 320 

To'abaita, 351 

Toaripi, 358 

Toba, 289, 375 

Toba Batak, 344 

Tobelo, 359 

Tobo, 354 

Tocharian A, 55, 286, 325 

Tocharian B, 55, 286, 325 

Tocho, 302 

Toda, 138, 330 

Tofamna, 354 

Toga, 352 

Togoyo, 306 

Tojolabal, 368 

Ток Pisin, 377 

Tolai, 349 

Tolitoli, 342 

Tolomako, 352 

Tolowa, 365 

Tombulu, 342 

Tomini, 342 

Tomu, 355 

Tonda, 358 

Tondano, 342 

Tong, 335 

Tonga (BANTU, ZONE M), 
315 

Tonga (BANTU, ZONE N), 
315 

Tonga (BANTU, ZONE S), 
316 

Tonga-Inhambane, 316 

Tongan, 352 
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Tongareva, 353 
Tongian, (SAMA-BAJAW), 


344 
Tongian;(SAMA-BAJAW), 
344 
Tongwe, 314 
Tonkawa, 208, 214, 217f, 369 
Tonsawang, 342 
Tonsea, 342 
Tontemboan, 342 
Topotha, 318 
Tora, 373 
Toram, 320 
Torau, 550 
Tori Aikwakai, 358 
Toromono, 376 
Torres Strait Pidgin, 377 
Torwali, 325 
Tosu, 333 
Totela, 315 
Toto, 331 
Totonac, 204, 368 
Totoro, 371 
Towa, 236, 370 
Towei, 358 
Tregami, 325 
Tri, 335 
Trinitario, 374 
Trio, 375 
Triometesen, 375 
Tripuri, 332 
Trique, 570 
Trukese, 351 
Trumai, 240, 372 
Trung, 333 
Ts'un-lao, 338 
Tsaangi, 313 
Tsagu, 322 
Tsaiwa, 333 
Tsakonian, 37, 326 
Tsamay, 322 
Tsaxur, 324 
Tsetsaut, 199, 365 
Tsimshian, 207, 214, 217f, 
235, 367 
Tsogo, 313 
Tsonga, 95, 316 
Tsou, 338 
Tswa, 316 
Tswana, 76, 95, 286, 316 
Tuat, 320 
Tubar, 237, 370 
Tübatulabal, 236f, 370 
Tubeta, 314 
Tubetube, 348 
Tubu, 317 
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Тасапо, 289, 372 

Tucundiapa, 372 

Tudza, 333 

Таригі, 320 

Tukang-Besi, 342 

ТаКеп, 318 

Tukiumu, 333 

Tukpa, 331 

Tukudede, 346 

Tukum, 302 

Tukumanfed, 373 

Tula, 305 

Tulishi, 302 

Tulu, 138-41, 330 

Tumak, 320 

Tumbuka, 315 

Tumleo, 347 

Tumma, 302 

Tumtum, 302 

Tumuip, 349 

Tunebo, Central, 371 

Tunebo, Eastern, 371 

Tunebo, Western, 371 

Tunen, 312 

Tunica, 208, 213, 218, 368 

Tunjung, 343 

Tunya, 306 

Tupari, 373 

Tupuri, 306 

Tura, 303 

Turaka, 356 

Turi, 334 

Turkana, 318 

Turkic, Old, 328 

Turkish, 127, 287, 328 

Turkish, Crimean, 328 

Turkmen, 127, 287, 328 

Turkwam, 310 

Turrubal, 363 

Turu, 358 

Turum, 302 

Turunggare, 359 

Tuscarora, 205, 228, 233, 
307 

Tusha, 240, 372 

Tushan, 337 

Tushinawa, 376 

Tusya, 304 

Tutchone, Northern, 365 

Tutchone, Southern, 365 

Tutelo, 233, 367 

Tutoh Kenyah, 343 

Tutong, (BISAYAN), 342 

Tutong, (REJANG- 
BARAM), 343 

Tutuba, 352 


Tuva, 328 
Tuvalu, 353 
Tuwari, 360 
Tuyuca, 372 
Twampa, 319 
Twana, 366 
Tyeliri, 304 
Tyurama, 305 
Tzeltal, 368 
Tzotzil, 204, 368 
Tzutujil, 369 


Uaripi, 358 
Ubaghara, 309 
Ubang, 310 

Ubir, 348 

Ubyx, 324 

Udihe, 329 
Udmurt, 328 
Ufim, 354. 
Ugaritic, 323 
Ughele, 350 
Ugong, 333 
Uhami, 309 
Uhunduni, 356 
Uighur, 127, 287, 328 
Uighur, Yellow, 329 
Ujir, 346 

Ukpe, 310 

Ukpet, 309 
Ukrainian, 38, 287, 327 
Ukue, 309 
Ukuriguma, 357 
Ukwuani, 309 
Ulau, 347 

Ulingan, 357 
Ulithian, 351 

Ulu, 335 

Uman, 239, 572 
Umbindhamu, 363 
Umbrian, 326 
Umbugarla, 188, 362 
Umbule, 332 
Umbuygamu, 363 
Umiray Dumaget, 338 
Umon, 309 
Umotina, 376 
Umpila, 363 
Umpqua, 365 
Unami, 366 
Unank, 348 
Uneme, 309 
Ungarinjin, 362 
Unquachog, 366 
Unserdeutsch, 377 
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Unua, 352 

Ura (CARA), 310 
Ura (OCEANIC), 350 
Urak Lawoi', 344 
Urarina, 371 
Urat, 359 

Urdu, 38, 286, 325 
Urhobo, 309 
Uria, 358 
Urigina, 356 
Urii, 354 

Urim, 178, 359 
Urimo, 360 
Uripiv, 352 

Uru, 373 
Uruangnirin, 346 
Uruava, 350 
Urubu, 373 
Uruku, 373 
Urukuyana, 375 
Urumi, 373 
Urunamacan, 373 
Urupa, 373 
Usino, 356 
Usku, 354 
Uspantec, 369 
Usu, 357 

Utoro, 302 

Utu, 357 
Utugwang, 310 
Uvbie, 309 
Uvea, East, 353 
Uvea, West, 353 
Uvol, 349 

Uzbek, 127, 287, 328 
Uzekwe, 309 


Vafsi, 326 
Vagala, 305 
Vaghua, 350 
Vai, 303 

Vale, 318 
Valman, 175, 359 
Valpei, 352 
Van Kiéu, 335 
Vandalic, 327 
Vangunu, 350 
Vanimo, 359 
Vano, 350 
Vao, 352 
Varisi, 350 
Vatrata, 352 
Vayu, 331 
Vedda, 325 
Vehes, 348 
Vejoz, 375 
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Venda, 316 
Vepsian, 328 
Уеге, 306 
Vidunda, 314 
Viemo, 304 
Vietnamese, 149, 287, 
335 
Vilela, 375 
Vinmavis, 352 
Vinza, 314 
Vira, 314 
Virac, 341 
Уігі, 306 
Vitu, 349 
Votic, 328 
Vovo, 352 
Vuite, 332 
Vunapu, 352 
Vute, 311 


Wa, 335 
Waanyi, 362 
Wab, 347 
Wabo, 347 
Wabuda, 358 
Wabui, 375 
Wadaginam, 357 
Wadapi-Laut, 347 
Wadjalang, 363 
Wadjari, 364 
Wadjiginy, 362 
Waffa, 354 
Waga, 363 
Wagawaga, 348 
Wagaya, 364 
Wagiman, 362 
Wahgi, 355 
Waia, 358 
Waibuk, 360 
Waicuri, 215, 369 
Waigeo, 347 
Wailaki, 365 
Wailapa, 352 
Waima’a, 346 
Waimiri, 375 
Waina, 358 
Wainamari, 374 
Wainuma, 374 
Waiwai, 375 
Waja, 305 
Wajamli, 347 
Waka, 306 
Wakasihu, 345 
Wakhi, 325 
Walangama, 363 
Walapai, 369 


Waling, 332 
Walio, 360 
Wallace, 344 
Walmajarri, 364 
Wama (CARIB), 375 
Wama (GUR), 305 
Wamas, 357 
Wambaya, 362 
Wambon, 355 
Wampar, 348 
Wampur, 348 
Wanambre, 357 
Wancho, 332 
Wandabong, 354 
Wandamen, 347 
Wanggom, 355 
Wangkumara, 364 
Wanji, 314 
Wano, 356 
Wantoat, 354 
Wanukaka, 346 
Wanuma, 357 
Wanyam, 373 
Waorani, 371 
Wapi, 360 
Wapishana, 374 
Wappo, 230, 235, 368 
Wara, 304 
Waraicu, 374 
Warao, 238f, 289, 371 
Warapu, 359 
Waray (AUSTRALIAN), 188, 
362 
Waray (MESO-PHILIP- 
PINE), 341 
Wardaman, 362 
Warenbori, 179, 377 
Wares, 358 
Wargla, 320 
Waris, 358 
Wariyangga, 364 
Warji, 322 
Warkay, 355 
Warlang, 362 
Warlmanpa, 364 
Warlpiri, 364 
Warluwara, 364 
Warnang, 302 
Warndarang, 362 
Warnman, 364 
Waropen, 347 
Warrgamay, 363 
Warumungu, 362 
Waruna, 355 
Warwa, 362 
Wasembo, 357 
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Washo, 211-18 passim, 234, 
369 
Wasi, 361 
Wasi-weri, 325 
Waskia, 357 
Watakataui, 360 
Wataluma, 348 
Watam, 361 
Wathawurung, 363 
Watubela, 345 
Waunana, 371 
Waura, 374 
Wayana, 375 
Wayoro, 373 
Wayumara, 375 
Weda, 347 
Wedau, 348 
Weh, 312 
Weining, 333 
Weleki, 354 
Welsh, 30, 33, 37, 287, 327 
Wemale, 345 
Wemba Wemba, 363 
Weretai, 358 
Weri, 356 
Wescos, 377 
Western Desert Language, 
184, 288, 364 
Wetamut, 352 
Wetar, 346 
Weyewa, 346 
Whitesands, 350 
Wiaki, 359 
Wichita, 205, 232, 288, 367 
Wik-liyanh, 363 
Wik-Me'anha, 363 
Wik-Mungkan, 363 
Wik-Ngathana, 363 
Winnebago, 367 
Wintun, 368 
Winye, 305 
Wiradhuri, 363 
Wirafed, 373 
Wirangu, 364 
Wirdimay, 364 
Wirina, 374 
Wiru, 354 
Wiyot, 208-17 passim, 222, 
228, 231, 366 
Wobe, 304 
Woccon, 367 
Wodani, 356 
Wogamusin, 360 
Wogeo, 347 
Woi, 347 
Woisika, 359 
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Wolio, 342. 

Wolof, 95, 303 
Wom(ADAMAWA), 306 
Wom (TORRICELLI), 359 
Wori, 312 

Worora, 362 
Wotapuri, 325 
Wotu, 342 

Wu, 143, 331 
Wu-ming, 337 
Wudhadhi, 363 
Wulguru, 363 
Wuliwuli, 363 
Wulna, 362 
Wumvu, 313 
Wunambal, 362 
Wusi, 352 
Wutung, 359 
Wuumu, 313 
Wuvulu, 349 
Wuywurrung, 363 


/Хатп, 301 

Хатпіг, 322 

Xamta, 322 

Kanty, 65, 328 

Karacuu, 351 

Xaragure, 351 

Xarua, 349 

Xasonke, 303 

ІГХеКугі, 301 

Xhosa, 76, 95, 286, 316 

Kiang, 143, 331 

Kinalug, 324 

Хіпса, 208, 213, 215, 219, 
238, 371 

Хігі, 301 

Kokleng, 376 

KU, 301 

Kvarshi, 324 


Ya, 338 
Yaaku, 322 
Yabaana, 374 
Yabarana, 374 
Yaben, 357 
Yabim, 348 
Yabio, 360 
Yabiyufa, 354 
Yabong, 356 
Yabula-Yabula, 363 
Yabuti, 376 
Yafi, 358 
Yagawak, 354 
Yaghnobi, 325 
Yagomi, 354 


Yagua, 374 

Yagwoia, 355 

Yahadian, 358 

Yahang, 359 

Yahi, 369 

Yahup, 372 

Yaka (BANTU, ZONE A), 
312 

Yaka (BANTU, ZONE H), 
314 

Yakan, 343 

Yakha, 332 

Yaknge, 354 

Yakoma, 306 

Yakpe, 307 

Yakut, 328 

Yala, 308 

Yalarnnga, 364 

Yalayu, 351 

Yalu, 348 

Yalunka, 302 

Yamalele, 349 

Yamaluba, 376 

Yamamadi, 373 

Yamana, 278, 372 

Yamap, 348 

Yamba, 311 

Yambasa, 312 

Yambes, 359 

Yamdena, 346 

Yameo, 374 

Yami, 339 

Yaminahua, 376 

Yan-nhangu, 362 

Yana, 369 

Yanda, 363 

Yandruwandha, 364 

Yangman, 362 

Yangulam, 357 

Yano, 331 

Yanomami, 289, 371 

Yanomamó, 371 

Yanyuwa, 188, 362 

Yanzi, 313 

Yao (BANTU), 315 

Yao (CARIB), 375 

Yao (MIAO-YAO)—Mien, 
148, 154, 287, 334 

Yapese, 166ff, 338 

Yapunda, 359 

Yaqay, 355 

Yaqui, 370 

Yarawata, 357 

Yardliwarra, 364 

Yareba, 356 

Yari, 377 
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Yarluyandi, 364 

Yaruma, 375 

Yaruro, 373 

Yasa, 312 

Yashi, 310 

Yau (FINISTERRE), 354 

Yau (TORRICELLI), 359 

Yauapery, 375 

Yaul, 360 

Yaur, 347 

Yava, 359 

Yavapai, 369 

Yavitero, 374 

Yawalapiti, 374 

Yawuru, 362 

Yay, 337 

Yaygir, 363 

Yazdi, 326 

Yazgulami, 326 

Ye'cuana, 375 

Yeci, 315 

Yega, 356 

Yekora, 356 

Yela, 513 

Yele, 361 

Yellow Uighur, 329 

Yelmek, 358 

Yendang, 306 

Yerakai, 360 

Yeretuar, 347 

Yeskwa, 310 

Yeu, 335 

Yey, 358 

Yiddish, 37, 60, 287, 327 

Yidgha, 325 

Yidiny, 363 

Yil, 359 

Yilba, 363 

Yiman, 363 

Yimas, 360 

Yimchungru, 332 

Yinbaw, 331 

Yindjibarndi, 364 

Yindjilandji, 564 

Yinggarda, 364 

Yiningay, 363 

Yinwum, 363 

Yir Yoront, 365 

Yirandhali, 565 

Yis, 359 

Yitha-Yitha, 364 

Yoba, 349 

Yogad, 538 

Yoidik, 357 

Yokuts, Foothill North, 235, 
368 


Language Index 


Yokuts, Foothill South, 235, 
208 

Yokuts, Valley, 368 

Yom, 305 

Yombe, 314 

Yonggom, 355 

Yonkalla, 367 

Yoruba, 95, 97, 286, 308 

Yotafa, 347 

Yotayota, 363 

Yuaga, 351 

Yuan, 338 

Yuberi, 373 

Yucatec, 204, 368 

Yuchi, 208, 213-18, 288, 
367 

Yucuna, 374 

Yue, 287, 331 

Yugambal, 363 

Yukaghir, 64, 69, 259, 328 

Yuki, 208-16 passim, 230, 
235, 368 

Yukpa, 375 

Yukuben, 309 

Үшін, 318 

Үшпа (САКІВ), 375 

Yuma (YUMAN), 209, 289, 
369 


Yumana, 374 

Yumbri, 335 

Yung-chu’un, 338 

Yungur, 306 

Yunnan Shant'ou, 338 

Yupik, Central, 194-95, 329 

Yupngayth, 363 

Yuqui, 373 

Yuracare, 240, 372 

Yuri (MACRO-TUCA- 
NOAN), 372 

Yuri (UNCLASSIFIED), 
179, 377 

Yurimangui, 230, 234, 369 

Yurok, 208-17 passim, 222, 
228, 231, 366 

Yuruna, 373 

Yuruti, 372 

Yuwana, 377 

Yuyu, 364 


Zaar, 322 

Zagawa, 317 

Zahao, 333 

Zanaki, 314 

Zande, 307 

Zaparo, 371 

Zapotec, 191, 204, 289, 371 
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Zaramo, 314 

Zayse, 322 

Zaza, 326 

Zazao, 350 

Zeem, 322 

Zeme, 332 

Zenaga, 93, 320 
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volume in particular assumes no background in linguistics. A postscript 
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